Vol. XXXII. Parts III & IV 


BIOMETRIKA 


A JOURNAL FOR THE STATISTICAL STUDY OF 
BIOLOGICAL PROBLEMS 


FOUNDED BY 
W. F. R. WELDON, FRANCIS GALTON anp KARL PEARSON 


EDITED BY 
EGON 8. PEARSON 


IN CONSULTATION WITH 
HARALD CRAMER J. B. 8S. HALDANE 
R. C. GEARY G. M. MORANT 
MAJOR GREENWOOD JOHN WISHART 


Reprinted by offeet-litho, 1952, 1960 


ISSUED BY THE BIOMETRIKA OFFICE 
UNIVERSITY COLLEGE, LONDON 


AND PRINTED AT THE 
UNIVERSITY PRESS, CAMBRIDGE 


PRINTED IN GREAT BRITAIN 


[Issued 22 April 1942] 








VoLtumE XXXII. Parts III anp IV Apri, 1942 





MEDICAL STATISTICS FROM GRAUNT TO FARR 


(Continued * ) 
BY MAJOR GREENWOOD 


Ill. THE STATISTICAL WORK OF GRAUNT 


JOHN GRAUNT’S contribution to our subject has always been regarded as one 
of the great classics of science. A few have indeed doubted whether so great 
a work could have been achieved by one whose material success was so modest 
and have sought to transfer the glory to Graunt’s highly successful friend Petty. 
This dispute I relegate to an appendix. I assume that Graunt’s published book 
is substantially his own original work. 

The history of the material Graunt used has been written more than once 
and I have nothing to add to Prof. Hull’s story. Graunt had, for a period of more 
than 60 years, arithmetical statements of the numbers of males and females 
christened and buried and of the causes of death (not distinguished by sex) 
under some sixty headings. He had no information as to the ages at death. He 
had no information as to the number or ages of the living population. 

The first act of a scientific statistician is to assess the trustworthiness of his 
date criticize his sources. This tedious preliminary to the doing of sums was 
not much to Petty’s taste. Petty, as we have seen, often used different data to 
reach some conclusion, but hardly ever discusses the reliabilities of the several 
data. Other Fellows of our College since Petty’s day have made the same 
mistake. The terrible ‘howler’ committed by Dr William Heberden the younger, 
and detected, not without satisfaction, by Charles Creighton is classical.t But 
that was not a unique instance. Indeed, even trained statisticians sometimes 
confuse names with things. More than one rate of mortality has risen (or fallen) 
only on paper. Graunt made no such mistakes. 

Graunt’s general argument is that many causes of death are ‘but matters of 
sense’, for instance, whether a child were abortive or stillborn, and that in many 
cases the searchers are ‘able to report the opinion of the physician, who was 
with the patient as they receive the same from the friends of the defunct’. But 
sometimes the searchers will be wrong and often enough the error will not matter. 


As for consumptions, if the searchers do but truly report (as they may) whether the 
dead corpse were very lean and worn away, it matters not to many of our purposes whether 
the diseases were exactly the same, as physicians define it in their books. Moreover, in 
case a man of seventy-five years old died of a cough (of which had he been free, he might 


* The earlier sections were printed in Biometrika, 32, 101-27. 

+ Creighton, History of Epidemics in Britain, 2, 747-8. Heberden supposed (erroneously) that 
‘Griping of the Guts’ of the Bills was Dysentery and had decreased. It was Infantile Diarrhoea 
and had simply been transferred to the rubric ‘Convulsions’, 


Biometrika xxx 14 








204 Medical statistics from Graunt to Farr 


have possibly lived to ninety) I esteem it little error (as to many of our purposes) if this 
person be in the table of casualties, reckoned among the aged, and not placed under the 
title of coughs (348).* 


No doubt this brutal common sense might set on edge the teeth of some 
Fellows of the College of Physicians even in the seventeenth century, but it was 
one of the qualities which made Graunt a pioneer. Making the best the enemy 
of the good is a sure way to hinder any statistical progress. The scientific purist, 
who will wait for medical statistics until they are nosologically exact, is no wiser 
than Horace’s rustic waiting for the river to flow away. 

Graunt, however, did not accept statements which he had the means of 
testing. Finding in a series of years that of more than a quarter of a million 
deaths only 392 were assigned to the Pox, he did not infer that Syphilis had 
been over-rated as a cause of death. 


Forasmuch as by the ordinary discourse of the world it seems a great part of men have, 
at one time or other, had some species of this disease, I wondering why so few died of it, 
especially because I could not take that to be so harmless, whereof so many complained 
very fiercely ; upon enquiry, I found that those who died of it out of the hospitals (especially 
that of Kingsland, and the Lock in Southwark) were returned of ulcers and sores. And in 
brief, I found, that all mentioned to die of the French Pox were returned by the clerks of 
St Giles’ and St Martin’s in the Fields only, in which places I understood that most of the 
vilest and most miserable houses of uncleanness were: from whence I concluded, that only 
hated persons, and such, whose very noses were eaten off were reported by the searchers 
to have died of this too frequent malady (356). 


In principle, the argument is still valid. 

His next example of criticism is the case of Rickets, which first appeared in 
the Bills of Mortality in 1634 and then with 14 deaths only, but by 1659 had 
risen to 441. Was Rickets a ‘new disease’ or did an old disease receive, in the 
Bills, a new name? 


To clear this difficulty out of the bills (for I dare venture on no deeper arguments) 
I enquired what other casualty before the year 1634, named in the Bills, was most like the 
rickets; and I found, not only by pretenders to know it, but also from other Bills, that 
livergrown was the nearest» For in some years I find livergrown, spleen, and rickets, put 
all together, by reason (as I conceive) of their likeness to each other. Hereupon I added 
the livergrowns of the year 1634, viz. 77, to the rickets of the same year, viz. 14, making 
in all 91; which total, as also the number 77 itself, I compared with the livergrowns of the 
precedent year 1635, viz. 82. All which showed me, that the rickets was a new disease over 
and above. Now, this being but a faint argument, I looked both forwards and backwards, 
and found that in the year 1629, when no rickets appeared there were but 94 livergrowns; 
and in the year 1636 there were 99 livergrowns, although there were also 50 of the rickets: 
only this is not to be denied, that when the rickets grew very numerous (as in the year 1660, 
viz. 521) then there appeared not above 15 of livergrown. In the year 1659 were 441 rickets 
and 8 livergrown; in the year 1658 were 476 rickets and 51 livergrown. Now though it be 
granted that these diseases were confounded in the judgment of the nurses, yet it is most 
certain that the livergrown did never but once, viz. anno 1630, exceed 100; whereas anno 


* Numbers in brackets are page references to Prof. Hull’s edition of The Economic Writings of 
Sir William Petty together with the Observations upon the Bills of Mortality more probably by Captain 
John Graunt, Cambridge, 1899. 


Masor GREENWOOD 205 


1660, livergrown and rickets were 536. It is also to be observed, that the rickets were never 
more numerous than now, and that they are still increasing; for anno 1649, there were but 
190, next year 260, next after that 329 and so forwards, with some little starting backwards 
in some years, until the year 1660, which produced the greatest of all (357-8). 


This is an excellent statistical argument, and, incidentally, evidence that 
Graunt wrote his own book, for a physician would probably have suggested that 
the professional interest excited by the classical treatise of Glisson (assisted by 
Regimonter) which was published in 1650 might easily have increased the 
popularity of the diagnosis. Petty, who, with Glisson, was a founder of the 
Royal Society, would hardly have ignored his colleague’s work. 

I cannot resist the desire to mention others which, while of little statistical 
importance, have a medical attraction. Graunt noticed that Stopping of the 
Stomach first appeared in the Bills of 1636, increased from 6 to 29 by 1647, by 
1655 it reached 145, in 1657, 277 and 1660; 314. First he conjectured that 
Stopping of the Stomach might be the Green Sickness, ‘forasmuch as I find few 
or none to have been returned upon that account, although many be visibly 
stained with it’. He thought that possibly Green Sickness might not appear in 
the Bills ‘for since the world believes that marriage cures it, it may seem indeed 
a shame, that any maid should die uncured, when there are more males than 
females, that is, an overplus of husbands to all that can be wives’. Then he 
wondered whether Stopping of the Stomach might not be Mother, ‘forasmuch 
I have heard of many troubled with Mother Fits (as they call them) although 
few returned to have died of them’. But he was diverted by guessing ‘rather 
the Rising of the Lights might be it’. He remembered that some women troubled 
with the Mother fits did complain of a choking in their throats. ‘Now, as I 
understand, it is more conceivable that the Lights or Lungs (which I have heard 
called the bellows of the body) not blowing, that is, neither venting out, nor 
taking in breath, might rather cause such a choking, than that the Mother should 
rise up thither, and do it. For methinks, when a woman is with child, there is 
a greater rising, and yet no such fits at all’ (359). He notes that Rising of the 
Lights increased in the Bills from 44 in 1629 to 249 in 1660. 

Finally, he suggests a correlation between Stopping of the Stomach, Rising 
of the Lights in adults and the Livergrown, Spleen and Rickets of children. 
‘And that what is the Rickets in children, may be the other in more grown 
bodies; for surely children which recover of the Rickets, may retain somewhat 
to cause what I have imagined: but of this let the learned physicians consider, 
as I presume they have’ (359). 

It might be suggested that one item under Stopping of the Stomach could 
be surgical, viz. strangulated hernia. Rupture was a heading in the Bills, but 
the numbers are small and show no regular increase with the increase of popu- 
lation. Graunt’s attraction to what used to be called hysterical stigmata is 
interesting. One wonders how far these passages reflect conversations with 
Petty. It is clear that Graunt had no belief in the peripatetic uterus; Petty 








206 Medical statistics from Graunt to Farr 


would have had none. The best medical opinion of the age is, of course, that of 
Sydenham. Sydenham (whose pathology was traditional) had a pneumatist 
aetiology of Hysteria, the origin was an ataxia of the animal spirit (which was 
the pneuma zotikon of ancient tradition). He not only believed that Hysteria 
might be a serious or even mortal complication of organic disease—as we do 
still—but that the ataxic spirits might themselves produce humoral corruption 
and lead to chlorosis or ovarian dropsy (Dissertatio epistolaris, 92). So there is 
nothing repugnant to the best professional opinion of the age in admitting 
Hysteria to the list of causes of death. Nor is there any gross absurdity in the 
suggested correlation of increasing Rickets and increasing Hysteria, from the 
point of view of a layman. But that surmise does not imply any professional 
hint, it rather suggests a belief in a merely physical factor, the pressure of an 
enlarged organ. That passage would not have been written by a physician. 

These are sufficient instances of Graunt’s criticism of sources—the temptation 
to go on quoting examples must be resisted. I pass to his great achievement, 
the estimation of rates of mortality at ages when the numbers and ages of the 
living were not recorded. For such an estimation to be correct, we all know that 
the population must be stationary, viz. non-increasing, not subject to migration 
and having constant rates of mortality in the several age groups. 

It is a nice point whether Graunt or Petty apprecicted the importance of 
these considerations. Graunt was certainly alive to the fact that the population 
of London was growing and that the growth was due to immigration from the 
country. The arithmetical position was this. In the earlier years of his series 
burials and christenings were about equal in numbers, in 1605 there were 5948 
burials and 6504 christenings; in 1625, 7850 burials and 7682 christenings, in 
1635, 10,651 burials and 10,034 christenings. Later the burials continued to 
increase, but the christenings either decreased or failed to increase in the same 
proportion. This Graunt attributed to neglect of christening owing to religious 
dissidence and gave excellent reasons for his view. It is clear then that there 
were two factors of increase, immigration and increasing numbers of births. 
Most of Graunt’s deductions are based upon an analysis of the deaths by causes 
for twenty years, 1629-36 and 1647-58, which he selected as years comparatively 
unaffected by plague (of his total of 229,250 deaths only 16,000 were from 
plague). 

If we treat this total as a denominator (or one-twentieth of it) it will, from 
the point of view of calculating mortality ratios, be affected by two errors. The 
deaths of immigrants will make it too large and the increasing births will make 
it too small. Can it be Graunt held that the errors balanced so that, arithmetically 
speaking, one might behave as if one were dealing with a stationary population? 
An alternative explanation is that Graunt did not realize the limitations of the 
method. 


A third possibility is that, although he knew the fallacy, he believed that 


Masor GREENWOOD 207 


the incorrect method gave an approximation to truth sufficient for his purposes. 
This is the solution I should be inclined to adopt were I forced to choose. 

As I have pointed out above, there is at least a suggestion that Petty did 
have some glimmering of the conditions to be fulfilled if a summation of deaths 
is to give a correct view of rates of mortality. I do not believe that Graunt was 
less informed on any point of vital statistics than Petty. However, all this is 
guess-work. 

Graunt did not know the ages of the dead; what he did was to pick out 
of the list of causes of deaths those which he thought lighted only upon children 
‘not more than four or five years old’. He chose Thrush, Convulsions, Rickets, 
Teeth and Worms, Abortives, Chrysomes, Infants, Livergrown and Overlaid. 
These gave him some 70,000 out of some 229,000. Then he assigned half the 
deaths from Small Pox, Swine Pox, Measles and Worms without Convulsions 
also to children under six and reaches the final conclusion that about ‘36°, of 
all quick conceptions die before six years old’. 

Is this conclusion—I will not say correct, because we have no-data to reach 
a correct result—but of a reasonable order of magnitude? The answer is that 
it is eminently reasonable. Two hundred years after Graunt’s death, William 
Farr printed (in the famous Supplement to the 35th Annual Report of the Registrar- 


puted by an approximately correct method, using knowledge of the numbers 
and ages of the living population, and refiects the conditions of seventy-five 
years ago. Interpolating in this we find that about 32°, of ‘quick conceptions 
died before six years old’. There is no good medical reason for holding that the 
conditions of child life in London in the middle of Victoria’s reign were much 
better than in the seventeenth century. The old genius used a bow with a frayed 
string and made no allowance for windage, but his arrow hit the target not far 
from the white. He gave the first quantitative measure of the Herodian sacrifice 
in towns, a sacrifice which was to continue to be offered for more than 200 years. 

Graunt then passed to the other end of life and found that 7°% of the dead 
were ‘aged’. He conceived that the searchers would mean by ‘aged’ persons of 
70 years or upwards, ‘for no man be said to die properly of Age who is much 
less’. His following suggestion that the proportion living beyond 70 might be 
used as a measure of healthfulness is not happy. But this calculation may have 
led him to make, or insert, the most famous passage in his book, viz. what is, 
in form, the first Life Table ever published. 

Whereas we have found, that of 100 quick Conceptions about 36 of them die before 
they be six years old, and that perhaps but one surviveth 76; we having seven decads 
between six and 76, we sought six mean proportional numbers between 64, the remainder, 
living at six years, and the one, which survives 76, and find that the numbers following 


are practically near enough to the truth; for men do not die in exact proportions, nor in 
fractions, from whence arise this Table following (386). 


Graunt’s figures are 100, 64, 40, 25, 16, 10, 6, 3, 1. 








208 Medical statistics from Graunt to Farr 


The one survivor to 76 is, as Graunt implies, a guess; perhaps he conjectured 
that his seven survivors beyond 70 died one a year. How he calculated his mean 
proportional numbers is unknown. Prof. Willcox conjectured taat he experi- 
mented with multipliers of 5/8 and 2/3—the former nearly reproduces the figures 
(see Willcox, Revue de l’ Inst. Intern. de Statistique, 5 (1937), 327). Ptoukha 
(Congrés Intern. de la Population; Démographie historique, p. 71, Paris, 1937) 
ingeniously suggests that he used the multiplier (64 —1)/100 or 0-63. 

We must, I fear, conclude sorrowfully that this shot did not find the bull’s 
eye. If Graunt’s survivors are compared with those shown in Halley’s table 
(when correctly used, vide infra), for 100, 64, 40, 25, 16, 10,°6, 3, 1, we should 
have 100, 56, 50, 45, 38, 31, 22, 14, 6. It is possible that child mortality was 
lower in London than in Breslau, but quite incredible that later age mortality 
should have been so enormously higher. 

But, of course, having regard to the data, it would have been more than 
genius, it would have been magic, had a correct result been obtained. 

Prof. Willcox, whose opinion of Graunt is almost as high as mine, regards the 
passage as inserted on the recommendation of Petty and as Petty’s composition. 
He thinks that it lacks Graunt’s caution and suggests the flighty ingenuity of 
his friend. Prof. Willcox’s arguments are weighty, but I am not convinced. 
That Graunt did not—to use the expressive slang—/feature his table is true. It is 
also true (vide supra) that passages in Petty’s undoubted writings imply that 
he had some conception of a survivorship table. But—and this is my main 
difficulty—if this were Petty’s idea, I find it difficult to believe that he would 
not have exploited it. Halley, whose economic scent was not so keen as Petty’s, 
saw the epoch-making importance of an idea which was to transform the business 
of selling annuities. It would be odd if Petty had seen it that he did not comment 
upon it. Graunt might well have hesitated, being a cautious statistician, but 
surely not Petty. 


However, in spite of modern practice, the writing of history wholly in terms 
of psychology has its pitfalls. 

Let us return to simpler applications of shop arithmetic. The advantages of 
country life over town life from the point of view of both mortality and morality 
had been a commonplace of poets, particularly those Roman poets who spent 
much of their lives in a city, long before the seventeenth century. Graunt was 
the first to apply an arithmetical test of mortality; he compared the statistics 
of Romsey with those of London. For Romsey he had ninety years’ data of 
marriage, christenings and burials. 

His statements about the population of the parish are not quite consistent. 
In one sentence he says that it ‘both 90 years ago, and also now, consisted of 
about 2700’, but a few lines later says ‘it neither appears by the burials, 
christenings, or by the built of new housing, that the said parish is more populous 
now, than 90 years ago, by above two or 300 souls’. A little later he says ‘it is 


Masor GREENWOOD 209 


clear that the said parish is increased about 300, and it is probable that 3 or four 
hundred more went to London; and it is known that about 400 went to New 
England, the Caribe Islands and Newfoundland within these last forty years’ 
(389). Actually, from an estimate of the number of communicants (which he 
assumes to be rather more than half the total population) he makes the average 
population between 2700 and 2800. Taking the average of burials for the whole 
period to be 58, this gives him a death rate of a little more than one in 50, 
which he contrasts with the London figure of one in 32 (apparently based on his 
count of 11 families with 88 persons amongst whom 3 deaths occurred in a year; 
but this is a rate of one in 29). 

There is no doubt a certain sketchiness about this, but it was not unreasonable 
to infer that the Romsey rate was much lower than the London rate. 

Graunt found that, unlike London, Romsey had an average excess of 
christenings over burials, they were in the ratio of 5 to 4. He estimates that over 
the period the natural increase was 1059, and, as will be seen from the quotation 
made, he allots about a third of this respectively to London, to the colonies and 
to the parish itself. He argues that supposing the population of all England to 
be fourteen times that of London and other parishes to send one-third of their 
natural increase to London, then the London burials should increase about 
200 per annum ‘and will answer the increase we observe’. 

Here again the argument is reasonable. He goes on to an investigation which 
has been severely criticized. He gives a table of the greatest and least number 
of burials in each of the ten-year periods for which he has data. In each decade 
but one the maximum is more than twice the minimum. But, he remarks, in 
no decade in the London experience is the largest number of burials twice the 
smallest number (he excludes deaths from plague from his statistics). ‘Which 
shews, that the opener and freer airs are most subject both to the good and bad 
impressions, and that the fumes, steams and stenches of London do so medicate 
and impregnate the air about it, that it becomes capable of little more, as 
if the said fumes rising out of London met with, opposed and jostled back- 
wards the influences falling from above, or resisted the incursion of the country 
airs’ (392). 

Prof. Hull shook his head over this passage. “This is an attempt to explain 
by physical conditions the wide range in the observed country death rate which 
is really due to the narrowness of the field—a single market town—under 
investigation. It is perhaps the gravest statistical mistake that can be charged 
against Graunt’ (Ixxvii). 

I do not like to leave a hero in the lurch. I must concede that if both Romsey 
and London burials were samples from a Poisson universe, the fact that the 
Poisson parameter for London was at least a hundred times that for Romsey 
would make it incredible that the London range, in terms ot the mean or of the 
standard deviation, should be so wide as that for Romsey. But Prof. Hull was 











210 Medical statistics from Graunt to Farr 


wrong in supposing that the wide range in the Romsey rates was due to the 
narrowness of the field of observation in a statistical sense. 
Taking Graunt’s 58 as the ‘expected’ annual deaths then, as 1/58 is small, 
the Poisson distribution is not far from the symmetry of a normal curve, and 
using the results of Tippett and E. S. Pearson, we may conclude that the 
expected range would be 23-45 + 6-073. The observed ranges for the successive 
decades are 32, 48, 78, 23, 65, 39, 121, 91, 52. All but one is greater than the 
expectation and six diverge by more than three times the standard error. 
Something more than small numbers is involved. Still, it must be confessed 
that Graunt did not anticipate the reasoning of James Bernoulli, although an 
intuition of genius may have led him to think that something more than ‘chance’ 
had play here. 
Graunt devoted special attention to the demographic influence of the plague. 
In the first place, he remarks that the attribution plague understated the 
mortality due to plague. He infers this from the fact that in plague years burials 
from other causes exceeded the average greatly, ‘from whence we may probably 
suspect, that about 1/4 part more died of the plague than are returned for such’. 
Next he inferred that after a great outburst plague lingered for several years. 
The plague of 1636 lasted twelve years, in eight whereof there died 2000 per annum 
one with another, and never under 300. The which shows that the contagion of the plague 
depends more upon the disposition of the air than upon the effluvia from the bodies of 
men. Which also we prove by the suddén jumps which the plague hath made, leaping in 
one week from 118 to 927; and back again from 993 to 258; and from thence again the very 


next week to 852. The which effects must surely be rather attributed to change of the air, 
than of the constitutions of men’s bodies, otherwise than as this depends upon that (366). 


Finally, he observes that within two years the city was re-peopled; a deduction 
from the time taken for the number of christenings to reach again the level of 
a pre-plague year. 

We may, if we please, smile at Graunt’s epidemiological inference. But it is 
a reasonable inference from the facts when we remember that in Graunt’s day— 
in spite of Fracastorius—contagium was not thought of as contagium vivum, 
but as a mere sympathetic vibration or passing on of something. 

I have, I hope, given an adequate sample of Graunt’s quality, but have not 
mentioned the most famous of all his deductions. Both in London and the 
country, on the average more males were christened than females, but more 
males died young or entered celibate occupations. So we reach this conclusion: 


We have hitherto said, there are more males than females; we say next that the one 
exceed the other by about a thirteenth part. So that although more men die violent deaths 
than women, that is, more are slain in wars, killed by mischance, drowned at sea and die 
by the hand of justice; moreover more men go to colonies and travel into foreign parts 
than women; and lastly, more remain unmarried than of women as fellows of colleges, 
and apprentices above eighteen etc. yet the said thirteenth part difference bringeth the 


business but to such a pass, that every woman may have an husband, without the allowance 
of polygamy (375). 


Masor GREENWOOD 211 


The story of how this arithmetical justification of God’s providence attracted 
the attention of Derham, of how Derham’s book fired the enthusiasm of the 
Prussian Army chaplain Johann Peter Siissmilch and of how Siissmilch’s book 
influenced Malthus has been well told by Hull. I should not myself rank this 
section high among Graunt’s researches. From a demographic point of view 
neither judicial hangings nor college fellowships could have had much effect in 
reducing the male excess. 

Even copious quotation fails to convey the spirit of a complete book. I have 
quoted good things, but many more remain. Graunt revealed sundry important 
truths and not the least important was that very imperfect data, if patiently 
considered, will tell us something it is good for us to know. If young medical 
officers going to parts of the empire where organized medical and demographical 
information is at no higher a level than that of seventeenth-century England— 
and there are many such places—were restricted to a single book on statistics, 


I should advise them to take not a modern scientific work, but old John Graunt’s 
Observations. 


APPENDIX 
Did Graunt write the book published over his name? 


John Graunt and William Petty were, as we have seen, close friends. Graunt, 
as the world judges success, failed and Petty succeeded. But by the judgment 
of scientific men in the seventeenth century, and ever since, the order of intel- 
Jectual precedence was reversed. From the moment of publication a few 
discerning people perceived the originality and importance of the Observations, 
the same people who, while admiring Petty’s verve, ingenuity and worldly 
success, did not take over-seriously his bright ideas. 

But Graunt was a man of one book. Save a note upon the multiplication of 
carp and the growth of salmon, he published nothing more. Petty went on 
writing, scheming and talking for thirteen years after Graunt’s death. That often 
enough in that period, the Observations were discussed over the wine—as they 
were quoted in Petty’s writings—we may suppose. That Graunt discussed his 
work with Petty both before and after publication we may also take for certain, 
although we have no formal proof of it. The country statistics which Graunt 
first used were from Petty’s native parish and even if we are not disposed-—as 
certainly I am not—to give much weight to particular turns of phraseology, still 
there are sufficient verbal oddities in some pages of Graunt’s book to suggest 
Petty’s hand. 

In these circumstances, it would not be very surprising if Petty’s associates, 
particularly those who were not good judges of statistical work, were to conclude 
that Petty’s share in the remarkable achievement of Graunt were greater than 











212 Medical statistics from Graunt to Farr 


appeared. It is not even judging Petty too harshly to suppose that he himself 
might come to share the opinion. There is no evidence that Petty ever did 
explicitly claim the credit. In one list of his writings (one of four), found among 
Petty’s Papers, he did include the Observations, which at least is evidence that 
he thought himself entitled to a share of the credit. We may suppose that if, 
in familiar intercourse, somebody had said ‘Come, confess Sir William, yours 
was the hand that guided the pen of poor John Graunt’, he might not have 
denied it very strenuously. I think I have produced evidence enough that Petty 
did not under-rate his powers and was not conspicuous for delicacy of feeling. 
My guess would be that long before his death he did come to believe that 
Graunt’s intellectual success was due to his help. 

Whether Petty believed this or not, it is certain that friends and associates 
of Petty began to believe it soon after Graunt’s death, and the belief has been 
entertained by a few peorle in each generation since. These, with one con- 
spicuous exception, have been drawn from Petty’s friends or descendants or 
from literary critics. 

In the seventeenth century, of Petty’s circle, Evelyn, Southwell and Aubrey 
believed or said that Petty wrote or inspired Graunt’s book. Two Fellows of the 
Royal Society, Houghton and Halley, also attributed the book to Petty. The 
only one of the five who was certainly a competent judge of scientific merit was 
Halley. Halley began the memoir which contains his Breslau table with these 
words: 


The contemplation of the mortality of mankind has, besides the moral, its physical and 
political uses, both which have some time since been most judiciously considered by the 
curious Sir William Petty, in his moral and political Observations upon the Bills of Mortality 
of London, owned by Captain John Graunt. And since in a like treatise on the Bills of 
Mortality of Dublin....But the deductions from those bills of mortality seemed even to 
their authors (sic) to be defective. (Phil. Trans. no. 196 (1693), p. 596.) 


Since the seventeenth century, there has been unanimity among demo- 
graphic statisticians and economists that Petty could not have written Graunt’s 
book. Halley was quite as good a judge of scientific merit as any of them and 
a contemporary of the canvassed writers; if I were sure that he had read and 
compared Petty’s acknowledged works with the Observations I should prefer his 
opinion to that of other ‘experts’—including, of course, my own. Halley’s 
direct testimony, in the sense of a court of law, would be valueless; he was only 
six years old when the Observations were published and became a Fellow of the 
Royal Society five years after the death of Graunt. There is no evidence that 
either before or after the period of writing and publishing his famous memoir, 
Halley worked on demography. After his memoir, but in his lifetime, a new 
epoch in mathematical vital statistics began. De Moivre, eleven years younger 
than Halley, brought out his principal works in the lifetime of Halley (1656- 
1742) and used Halley’s table. The two men must have been well acquainted, 


Mayor GREENWOOD 213 


for both were enthusiastic disciples and intimate friends of Newton, but Halley, 
like Graunt, made only one contribution to the literature of demography. 

So it may be doubted whether Halley were sufficiently interested in demo- 
graphic or economic writings to have read Petty’s tracts at all. Also in the 
passage cited above (apart from the writing of ‘authors’ not ‘author’) the collo- 
cation of the Observations on the Dublin Bills with those on the London Bills is 
curious. There is no doubt that the Observations on the Dublin Bills were the 
work of Petty, and in the first edition of them they are stated on the title-page 
to be by the ‘Observator on the London Bills of Mortality’. But this, as Prof. 
Hull pointed out (xlii), was probably a catch-penny device of the publisher, 
Mark Pardoe, to draw a public which had just taken a fifth edition of the London 
Observations. Actually the book did not sell, and when the publisher reissued an 
enlarged version, Petty’s name appeared on the title-page without any reference 
to the London Observations. I conclude that Halley’s evidence is less weighty 
than it seemed. He may well have had before him copies of Graunt’s book and 
of the two editions of the Dublin Observations. Having no other knowledge of 
the literature he would naturally enough write as he did. 

If we eliminate Halley, no other expert countenanced Petty’s authorship and 
one, Augustus de Morgan, gave an amusing but quite cogent reason for dis- 
missing the notion. 


In speaking of the variations in the annual numbers of deaths attributed to 
Rickets, Graunt said: 


Now, such back-starting seem to be universal in all things: for we do not only see in the 
progressive motion of wheels of watches, and in the rowing of boats, that there is a little 
starting or jerking backwards between every step forwards, but also (if I am not much 
deceived) there appeared the like in the motion of the moon, which in the long telescopes 
at Gresham College one may sensibly discern (358). 


De Morgan (Budget of Paradoxes, 68; Assurance Magazine, 8, 167) commented 
on the improbability that ‘that excellent machinist, Sir William Petty, who 
passed his day among the astronomers’ would attribute to the motion of the 
moon in her orbit all the tremors which she gets from a shaky telescope. 

Down to 1927 the matter was regarded, in scientific circles, as settled. In 
that year the late Marquis of Lansdowne published a copious selection of the 
Petty Papers with what he regarded as new evidence in favour of Petty. 

The only new evidence of a direct kind was a manuscript list in Petty’s hand 
of his writings or projected writings which included the Observations. There are 
three other lists which do not include the Observations, and if we are to suppose 
that the entry really referred to the book published under Graunt’s name, then 
we must believe that in 1685 and in 1686 Petty had forgotten his best title to 
scientific immortality. The remainder of the evidence consists of parallel passages 
and ad capiandum arguments to the effect that it was more probable that a 
physician had written on questions of medical statistics than a tradesman. This 











214 Medical statistics from Graunt to Farr 


publication led to a lively controversy. Of the merits of this, I, as a party to it, 
am not an impartial judge. Purely literary arguments do not appeal to me when 
the question is of scientific method. Thus, Dr L. F. Powell attached weight to 
the fact that Dr Johnson in conversation had attributed to Petty an observation 
(not statistical) which is made not in Petty’s writings but in Graunt’s book. 

In the discussion the word ‘style’ is used in different senses by the combatants. 
The statisticians are thinking of scientific method, the literary critics of verbal 
arrangement. To the former the fact that, particularly in the conclusions and 
the Appendix, Graunt’s book has turns of phraseology which suggest Petty’s 
hand, seems of little importance. To the latter it seems very significant. 

In the article by Prof. Willcox, which I have quoted above, the controversy 
is reviewed, and the author concurs generally with his statistical predecessors. 

Prof. Willcox does, however, differ from his predecessors in one important 
particular. He holds that the famous life table was supplied by Petty. He argues 


that this is far too conjectural to have been the work of so cautious a reasoner 
as Graunt: 


In attempting to reconstruct its origin I have surmised that after Graunt had estimated 
that 36 per cent. of the deaths were due to children’s diseases, that they all occurred under 
the age of six, and that the seven per cent. who were reported to have died ‘aged’ died at 
over 70 years of age (at one place he says over sixty), he felt unable to go further and 


reported his difficulty to Petty, already perhaps speculating about a series of similar 
problems. 


Petty guessed at the number of survivors at the end of each decennial age period, 
6-15, 16-25 etc. incidentally and characteristically ignoring Graunt’s theory that seven 
per cent. survived seventy, and assuming instead, without reason, that one per cent.survived 
seventy-six and not one per cent. eighty-six, and that the survivors at age six decreased with 
each age period in a geometrical progression approximately equal to the 64 per cent. which 
Graunt had set for the first group (326-7). 


Prof. Willcox’s argument is cogent. It may be strengthened by a criticism 
of the late Prof. Westergaard (Contributions to the History of Statistics (London, 
1932), p. 23). In using this table, Graunt made a serious blunder. In order to 
estimate the number of men of military age in London he subtracted the number 
alive at age 56 from the number alive at age 16. But this simply gives him the 
number dying between those ages ; what he wanted was some average of the 1,'s. It 
is evident that Graunt was not at all clear in his mind as to how to use a life table. 

On the other hand, if this table were really Petty’s idea, it is hard to under- 
stand why he did not exploit it. If Petty had been a Halley, the explanation 
would be obvious. The table is wrong; the conditions for the validity of the 
method were not fulfilled. There is indeed (vide supra) some evidence that Petty 
did know what data were necessary in order to construct a proper life table. 
One seems on the horns of a dilemma. If Petty thought the table was correct 
why did he make no further use of the method? If he thought it was wrong, 
would he have urged Graunt to insert it? 


Masor GREENWOOD 215 


Although Prof. Willcox has cértainly shaken my previous conviction, I still 
feel reluctant to surrender Graunt’s table to Petty. However, there may be an 
element of sentimentality in this. At least the statisticians agree that the answer 


to the question which I have placed at the head of this Appendix is emphatically 
yes. 


REFERENCES TO THE RECENT CONTROVERSY IN CHRONOLOGICAL ORDER 


1927. LANSDOWNE, Marquis of. The Petty Papers. 
1928. GREENWOOD, M. J.R. Statist. Soc. 91, 79. 


1928. LaNsDOWNE, Marquis of. The Petty Southwell Correspondence, pp. xxiii—xxxii. 
1932. LANSDOWNE, Marquis of. The Times Literary Supplement, 8 Sept. 

1932. Bretr-James, N. G. The Times Literary Supplement, 15 Sept. 

1932. GREENWOOD, M. The Times Literary Supplement, 22 Sept. 

1932. LansDowNngE, Marquis of. The Times Literary Supplement, 13 Oct. 

1932. Powe, L. F. The Times Literary Supplement, 20 Oct. 

1933. GREENWOOD, M. J. R. Statist. Soc. 96, 76. 

1937. Writcox, W. F. Revue de I’ Inst. Intern. de Statistique, 5, 321. 


IV. HALLEY’S LIFE TABLE 


The long and fruitful life of Edmund Halley (1656-1742) belongs to the 
general history of science; cf him it may indeed be said nihil quod tetigit non 


ornavit. He made only one contribution to our subject, but it was of first-rate 
importance. 


The circumstances of this undertaking are obscure; Halley would have 
perceived the imperfections of Graunt’s life table, but it is not known whether 
it was he who set on foot a search for better statistical material than Graunt 
had had. Inquiries were however made, and made after he had become a Fellow 
of the Royal Society, so it is at least possible that Halley, who had travelled 
extensively in Europe (he was at Danzig in 1679 and in Italy in 1681), suggested 
that something might be found abroad. By 1691, the King’s Librarian, Henry 
Justel, who was in touch with the Society, had been brought into communication, 
possibly through Leibniz, with Caspar Neumann, a scientifically minded evan- 
gelical pastor of Breslau. Neumann supplied the data which Halley used. 

In 1883, J. Graetzer, a medical-statistical official of Breslau, published a 
little monograph* which throws light upon the work. He not only extracted from 
the Breslau archives all the data which were or might have been communicated 
to the Royal Society but had the Society’s archives searched, with the result 
that a letter from Neumann to Justel and another from Neumann to Halley, 
both with statistical appendices, were discovered. Thanks to the labours of 
Graetzer and an essay by R. Biéckh (Bulletin de Inst. Intern. de Statistique, 7 
(1893), 1) we can form a reasonably clear idea of Halley’s method, which was 
not what those who have not examined the literature suppose it to have been. 

It is often stated that Halley, having found that during the five years of 
observation the number of births only slightly exceeded the number of burials, 

* Graetzer, J., Edmund Halley und Caspar Neumann, 1883. 








216 Medical statistics from Graunt to Farr 


treated the population as stationary and constructed a life table by a simple 
summation of the deaths in the manner already explained. He was much wiser. 
What he tried to do was to construct a population table, in the following way. 
Suppose we know how many children were born in a calendar year, say 1690, 
in a town not subject to migration which maintains accurate registers of ages 
at death, and then we discover how many of the children born in 1690 will be 
alive on each successive first of January by a series of subtractions. We shall 
have the survivors on 1 January 1691 by subtracting those of the children born 
in 1690 who died in 1690. We shall have the survivors on 1 January 1692 by 
subtracting the deaths in 1691 occurring among the survivors to 1 January 1691, 
and so on. This will give a precise enumeration of the living population, and this 
is what Halley wanted. The figures we shall obtain will not be the conventional 
1,’s of a Life or (as German writers say) Mortality Table, but what in most 
modern books are represented by the capital letter L or years of life lived or 
“persons” living between the termini (see Appendix). Jf the population is 
stationary, the sum of these figures gives the population of the place under 
study. Now for ages between | (last birthday) and very advanced ages, L, is 
simply 1, diminished by 3 (1,—1,,,). In the first year of life (and at advanced 
ages) the difference is greater. Thus in the first year of life deaths are not evenly 
distributed throughout the year of life, more than 70% of them occur in the 
first six months of life, so that instead of subtracting half the deaths we must 
subtract nearly three-quarters. Halley himself assigned 68% of deaths in the 
first year of life to the first half of the first year. The reason why Halley proceeded 
in this way was that he knew the population not to be stationary. His idea was 
to obtain the figures for the first few years of life accurately—indeed just as 
they are now obtained—and then to correct for excess of births over deaths. 
His masterly plan was partly defeated by the fact that his Breslau corre- 
spondent Neumann was not so good a statistician as he was. Halley’s letter to 
Neumann has not been preserved, we have only Neumann’s answer of 1 March 
1694. Probably Halley asked Neumann to send him (as a check on the calcula- 
tions he had already made) the exact numbers of survivors on 1 January for 
five years, of births in a calendar year. Neumann did send him a table, but the 
table, as Graetzer pointed out, is wrong. Neumann gave the correct figures for 
1 January of the first successive year, incorrect figures for the other years. 
Between 1 January of the year following the year the births of which are under 
study, and the next first of January, some of those born in the starting year will 
die under and some over a year of age. Neumann merely deducted the former, 
so he has too many survivors. To reach the right figure would have meant taking 
more trouble and he did not appreciate the importance of this. Boéckh—whose 
opinion of his statistical contemporaries has a tinge of bitterness rare, of course, 
in other scientific pursuits—remarks that it was not strange Neumann should 
miss the point as it had been missed by many statisticians long after his time. 


Mason GREENWOOD 217 


It is at least clear that Halley had realized an important truth which did not 
become part of even expert knowledge for more than a century. 

The precise arithmetical details of Halley’s work are not perhaps of much 
medical interest. Graetzer and Béckh have done a good deal to clear it up. The 
data used (an average of five years) had 1238 births and 1174 deaths and the 
table accounts for 1238 deaths. Halley must therefore have had a plan for 
increasing deaths. It is likely, from an observation he makes on mortality in 
Christ’s Hospital, that he did not wholly depend on the Breslau figures. Graetzer 
suggests that Halley may have made two graphs, one having an ordinate of 1238 
at the origin and an ordinate of 64 at the oldest age, the other an ordinate of 
1174 at the origin and 0 at the oldest age and that he plotted the survivors for 
each graph based on recorded deaths and drew a curve passing through 1238 
and 0 between these graphs. It may be so. Using the original material which 
Graetzer published, Béckh recalculated the table. The results do not, except at 
ages over 60, differ materially from Halley’s. So far as concerns the mean after 
life time (expectation of life), Halley’s table gives 27-54 years at birth, Graetzer’s 
material 27-69. For ages under 40 the re-working gives slightly lower and for 
later ages higher mortality. It may be noted that Halley’s table gives appre- 
ciabiy higher mortality in childhood than Graunt’s, more than 43°%% instead of 
36% are dead by the age of six years. But Graunt’s method would exaggerate 
mortality (so would Halley’s method, but, owing to his precautions, not so 
greatly). On the other hand, Graunt’s estimate of age is only an intelligent 
guess. Actually, as Graetzer showed, the infant and child mortality shown by 
Halley’s table differed little from the observed rates of mortality in the city of 
Breslau in 1876-80. 

It has been said that Halley was not greatly interested in the medical aspects 
of his work. After describing methods of calculating the prices of annuities, he 
has the following passage* : 


It may be objected that the different salubrity of places does hinder this proposal from 
being universal; nor can it be denied. But by the number that die being 1,174 per annum 
in 34,000 it does appear that about a 30th. part die yearly as Sir William Petty has 
computed for London; and the number that die in infancy is a good argument that the air 
is but indifferently salubrious. So that by what I can learn, there cannot perhaps be one 
better place proposed for a standard. At least ‘tis desired, that in imitation hereof the 
curious in other cities would attempt something of the same nature, than which nothing 
perhaps can be more useful. 


That the mortality of childhood depends upon the atmosphere is not so 
foolish an hypothesis as it may seem to us. Halley lived before breast-feeding 
became the exception rather than the rule. The ‘curious’ in other cities had not 
the wit to follow his advice. He made no other contribution to the science of 
vital statistics; a gain to astronomy but a heavy loss to demography. 


* I have read Halley’s paper in the collection of papers, many by him, collected under 
the title Miscellanea Curiosa, printed in London in 1705; the quotation is from p. 300. 











218 Medical statistics from Graunt to Farr 


APPENDIX 


Halley’s table is printed in two columns, the first headed ‘Age current’, the 
second ‘Persons’. Thus: 


Age current Persons 


COMHAGOARWNHe 
~I 
= 
Oo 


_— 


and so on. 


A mistake sometimes made is to suppose that Halley meant by age current 
simply the end of each year of life and that the entry against each ‘age current’ 
is the number of survivors at exact age one year less than the ‘age current’, 
viz. that of 1000 born 855 survived to the first anniversary, 798 to the second 
anniversary, etc. The fact that Halley uses the round number 1000 for a first 
entry does something to encourage the mistake among readers who have not 
consulted the original paper and it is sometimes made by people who should 
know better. It is actually a terrible ‘howler’, leading to a wholly false view of 
rates of mortality in early life. Thus if 1000 and 855 were really the first two 
entries of a Life Table as set out now, then, as the first two entries in English 
Life Table no. 7 Males (mortality 1901-10) are*1000 and 856, we might conclude 
that mortality in the first year of life was no lower in 1901-10 than in Breslau 
in the last years of the seventeenth century. But the 1000 of Halley’s table is 
not the number of new-born children but the average number out of 1238 born 
living between the ages of 0 and 1. This is what is called the L, of a modern 
table or the population living between the ages x and x+1. If we have a column 
of L,’s, which is what Halley gives us, we can deduce therefrom the more 
familiar 1,,’s provided we know the starting value and the number of deaths in 
the first year of life. Halley gives both items. He says that of 1238 annual births 
348 die annually. So that his 1, is not 1000 but 1238, and his 1, is 890. He chose 
1000 for L, by assuming that of the 348 deaths in the first year of life 238 occurred 
in the first six months of life, 68°. This differs very little from the modern 
practice; in Life Table no. 7 quoted above 73-5 % of the deaths in the first year 
of life are assigned to the first six months of life. Having been given 1, and 1, 
we can deduce the other 1’s from the values of the L’s which Halley gives 
because, after the end of the first year of life there is little error in supposing 
that the deaths between two birthdays are evenly distributed over the year: 
so, for instance, 1, will be equal to L, less half the difference between 1, and 1,, 





ne 


Mayor GREENWOOD 219 


and proceeding in this way we put Halley’s table into modern form. I attach 
a table calculated by Béckh. 

It will be seen that, if Halley’s table is properly used, the comparison is not 
of 1000 and 855 with 1000 and 856 but of 1000 and 719 with 1000 and 856. 

Actually this is still slightly optimistic, because I am comparing ‘persons’ 
with males. The ‘persons’ figure for 1901-10 is 1000, 869. On the other hand, 
in the Breslau data still births (or some of them) are included in births, so that 
the mortality is slightly exaggerated. If for instance 7% are still born, the 
survivors to 1 will be the same, but the 1, should be reduced to 930. Or alter- 
natively we should write 1000 and 773. 

I attach Halley’s table reduced to modern form and with the corresponding 
expectations of life calculated by Bockh (I have reworked some of the values 
from the data and agree with Béckh’s figures). 


Halley’s Table, expressed in modern form together with the Expectations of Life 
at quinquennial intervals (Béckh) 











Age | a | é, Age | 1. é, 
— | | 
0 | 10,000 | 27-54 40 | 3,557 22-05 | 
5 5816 | 41-47 45 | 3,167 | 1947 | 
10 | 5,307 | 40-25 50 | 2751 | 1705 | 
| 15 5,049 | = 37-19 55 | 2,319 | 1475 
2 | 4906 | 33:93 60 | 1,914 | 1233 
2 | 4,552 | 30-69 6 | 1,511 996 | 
30 | 4957 | 27-64 70 ~=©| «1,108 7-74 
35 | 3,921 | 24-78 75 | 670 750 | 





V. GUESSING THE POPULATION 


My object is to trace the growth in our country of that part of statistical 
science which is of interest to students of medicine or public health. In speaking 
of such pioneers as Graunt, Petty and Halley it was proper to construe the 
obligation rather freely. Both Graunt and Petty did cleariy perceive the relevance 
of their researches to matters of public health or even clinical medicine, but: much 
of what Petty did had a more direct bearing upon political questions than those 
of public health. Again, the life table is a way of expressing the facts of mortality 
which is valuable in some medical researches, but its importance as a statistical 
instrument has been much greater in non-medical than medical circles, above 
all of course in the financing of assurance business. The commercial importance 
of life tables was perceived by Halley and by other mathematicians of his and 
the following generations. 

Looking at the position after Halley’s publication it was clear that progress 
might be made (1) in improving the accuracy of the life table, viz. by obtaining 
data more relevant to the conditions of life of persons who assured their lives 
or bought annuities, (2) in simplifying the very laborious calculations which the 

Biometrika xxxu 15 








220 Medical statistics from Graunt to Farr 


determination of praemia or purchase values required. Under (1) no progress 
worth speaking of was made in England until the end of the eighteenth century. 
This was partly due, as we shall see, to a not entirely unjustified disbelief in the 
powers of the medical profession to change the rate of mortality, partly to 
ignorance. No first-rate English mathematician after Halley gave any critical 
attention to the theory of the Life Table before the nineteenth century. Under 
(2) considerable progress was made, but this progress is of little or no medical 
interest and to describe it would involve entering upon tedious arithmetical and 
algebraical detail. The primary medical-statistical quaesita are correct enumera- 
tions of deaths by sex at ages and by causes, and of the numbers living in sex 
and age groups. When these have been satisfied, the medical statistician can get 
to work. 

For 150 years after Graunt’s death very little was done to improve matters. 
Down to 1801 the population as a whole had not been counted; forty years more 
passed before a reasonable age distribution was secured, and it was thirty-eight 
years after the first denominator (populations) that the first numerator (deaths) 
of the fundamental fractions was obtained. Until 1801 intelligent guessing was 
the method and the guesses of the eighteenth century deserve a few pages, if 
only because they prove that statistical ability is as rare as other kinds of ability 
and that wishful thinking is not a modern foible. 

The first estimator to mention belongs to the seventeenth century and was 
a younger contemporary of Graunt and Petty, Gregory King (1648-1712). He 
was born in Lichfield, the son of a land surveyor. At the age of fourteen he was 
recommended as a clerk to the famous herald Dugdale with whom he worked 
for several years; after Dugdale had finished his Visitation, King worked for 
various amateur antiquaries and was eventually invited by a lady of property 
in Sandon (Staffordshire) to be her steward, auditor and secretary. Here he 
remained until 1672 when he moved to London and, no doubt through Dugdale’s 
recommendation, had a considerable amount of employment in both heraldic 
work and ordinary surveying. In 1677 he became a member of the College of 
Arms, in which he attained the rank of Lancaster Herald and so continued until 
his death, but worked for other official bodies on financial subjects. 

The decorous memoir by George Chalmers, from which I have extracted 
these particulars, does not give us a very life-like picture of the man himself. 
There is a certain likeness between the early careers of Petty and King. King 
was not indeed shipped as a cabin boy, but Mr King (the elder) drank (if we may 
venture so coarse an abbreviation of Chalmers’s statement that the father studied 
and practised his profession ‘with more attention to good fellowship than 
mathematical studies generally allow’) and King junior was a pupil teacher at 
eleven. If he really read Hesiod and Homer, made Greek verses and taught 
himself to survey land in his thirteenth year he must have had Petty’s precocity. 
Both Petty and King had experience of practical surveying and, of course, both 





Masor GREENWOOD 221 


were interested in political arithmetic. But there the parallel ends. King was 
a professional surveyor and archivist and had a reasonably successful professional 
career. Petty was—Petty. One might, perhaps, adduce as another parallel that 
King made some enemies and thought himself ill-used. But the job by which 
Sir John Vanbrugh, a stranger to the College of Arms, was made a king-at-arms 
over the head of an official of twenty years’ standing would have galled the 
meekest of mankind. One may safely conclude that King had more knowledge 
of the data of political arithmetic than Petty and less originality. His vital 
statistical work was not published until nearly a century after his death, as an 
appendix to the second edition of An Estimate of the Comparative Strength of 
Great Britain by George Chalmers, London, 1803. Perhaps he never intended 
to publish it—he communicated the substance to his contemporary Davenant— 
and this may explain why there are no details of how some of his results were 
reached. The report reads rather like a document prepared for official use by 
persons interested in results not methods. 

The starting-point of King’s attempt to estimate the population was a return 
from the Hearth Office of the number of houses assessed to tax on Lady Day, 
1690. That was 1,319,215 which, King estimated, had increased to 1,326,000 by 
1695. He deducted 30,000 for empty divided houses,* took the round figure of 
1,300,000 and assigned 105,000 to the London area, 195,000 to other cities and 
market towns and 1,000,000 to villages and hamlets. He used a series of multi- 
pliers, 5:4 for a house within the walls of London, 4-6 for a house within the 
liberties, 4-4 for the out parishes in Surrey and Middlesex and 4-3 for Westminster. 
For other towns, his multiplier was 4-3 and for villages 4-0. 

Having performed his multiplications he gives London a bonus of 10%, 
other towns 2% and villages 1%. Lastly he estimates homeless people to 
number 80,000. The final result to the nearest round number is 5} millions. 

How King obtained his multiplier is not clear. In addition to the Hearth 
Office data he says he used ‘the assessments on marriages, births, and burials, 
parish registers and other public accounts’ and that from these he deduced the 
multipliers, but this is rather vague. He also classified the population by sex, 
civil state and age (under 1, under 5, under 10, under 16, above 16, above 21, 
above 25, above 60). How he reached these figures is not explained. 

But nothing succeeds like success. As we shall see, his estimate of the total 


* Prof. E. C. K. Gonner (J.R. Statist. Soc. 76 (1912-13), 261-97), in an interesting paper which 
I have largely used in writing this chapter, remarks that the ‘houses’ of the Hearth Office must 
have been really families or separate occupations as King indeed realized, and thinks that King 
fell into some confusion in attempting to replace families by houses. Gonner argued that the best 
way was to proceed on the basis that the Hearth Office unit of a family should be retained and 
be corrected for empty houses, blacksmiths’ shops, etc. on the basis of 1801 census returns and the 
multiplier used should be persons per family of 1801. The result is to give a figure about a quarter 
of a million larger than King’s. The method described in the text also leads to the conclusion that 
King somewhat understated the population. 








222 Medical statistics from Graunt to Farr 


population is probably very near the truth and Prof. Westergaard has remarked 
that, judging from Swedish observations of a few years later, King’s age distri- 
bution is quite reasonable. 

As a statistical prophet King was no more successful than his contemporaries 
(and successors). He believed that down to his time the population of England 
had doubled in 435 years, that the next doubling would require from 1200 to 
1300 years and that in a.p. 3500-3600 the population would reach 22 millions 
of souls, in case, as he cautiously adds ‘the world should last so long’. His 
estimates as a matter of arithmetical curiosity are excellently fitted by a logistic 
with an upper asymptote of fifteen millions and would give the present popula- 
tion as about eight millions. 

Modern statisticians, such as Farr and Brownlee, have confirmed King’s 
estimate of the population at the end of the seventeenth century in the following 
way. After 1801 the population was known by actual counting and for the first 
forty years of the nineteenth century baptisms and burials were still the only 
data of births and deaths. If one started from, say, the enumeration of 1831 
and worked back to the population of 1821 by adding the numbers of burials 
and subtracting the number of baptisms then, if these really measured deaths 
and births, the result ought to agree with the census enumeration, provided 
immigration and emigration balanced. But the burials and baptisms under- 
stated deaths and births. One might adjust the figures by multipliers to bring 
the result into agreement with the census and then test against another backward 
run of ten years. Brownlee found that if the number of burials were multiplied 
by 1-2 and the number of baptisms by 1-243, the agreement was good. 

This may seem a highly conjectural method, but it certainly gives quite good 
results. The difference between births and deaths estimated in this way for the 
decennium 1801-10, I find to be about 12-4 per 1000 tiving. If one multiplies 
the enumerated population of 1801 by (1-0124)!© we reach 10-1 millions, not a 
bad: approximation to 10-2 millions actually counted. Assuming that before 
1801 burials and baptisms had the same relation to deaths and births as between 
1801 and 1841, we can work backwards to the beginning of the eighteenth 
century with the result that the population then was about 5-8 millions, not 
much more than King’s estimate. In view of the following discussion it will be 
useful to consider the probable state of the population (as determined by these 
methods) in the eighteenth century. In the first sixty years of the century it 
grew very slowly, was about 6-1 millions in 1751 and 6-5 millions in 1761. It 
then began to increase faster, was 7-5 in 1781, 8-2 by 1791 and 9-2 at the census 
of 1801 (§-9 as enumerated, but an estimate of a deficit of 1/30th was made). 

From Gregory King’s time to the census of 1801 we have a series of more or 
less intelligent guesses. 

These are well described in Prof. Gonner’s paper.* Two schools of thought 


* J.R. Statist. Soc. 76 (1912-13), 261-96. 


Mayor GREENWOOD 223 


did battle in the eighteenth century ; the pessimists who held that the population 
was decreasing and the country going steadily to the dogs, and the optimists 
who believed just the contrary. Both used the same weapons. The heavy artillery 
was a return of houses for taxation purposes increased conjecturally by a figure 
for houses which escaped taxation, the sum multiplied by a conjectural average 
of persons per house. As light artillery one had the yield of taxes on commodities 
and the returns of baptisms and burials. 

A pessimist put the number of untaxed houses low and the multiplier low, 
and an optimist raised both. 

The first controversy which took place in 1754 in the proceedings of the Royal 
Society did not attract much notice. Brackenridge (mildly pessimistic) pointed 
out that the number of houses assessed to house tax had decreased between 1710 
and 1754 from 729,048 to 690,000, which suggested a decrease of population 
(by a previous conjectural calculation based on burials and baptisms, he had 
reckoned a small increase, which was probably correct). Much turned on the 
number of houses which did not pay tax (either because the occupant was in 
receipt of alms, did not, owing te poverty, contribute to the church or poor rate, 
or through mere default). Brackenridge put the number at 200,000. His critic, 
Forster, argued that Brackenridge under-stated the number of untaxed houses, 
adducing a sample of nine country parishes with 588 houses of which only 177 
were taxed and a market town with 229 taxed houses out of 448. Using these 
figures as a basis for conjecture Forster raises Brackenridge’s 890,000 to 1,427,110. 
From this (with a multiplier of 6 for town houses and 5 for country houses) he 
reaches a population of seven and a half millions—probably a considerable over- 
estimate. 

The next controversy was a quarter of a century later (in a period when the 
population was certainly increasing) and its originator was Dr Richard Price 
(1723-91), who has attained a posthumous celebrity reminiscent of the man 
whose title to distinction was that he had once been kicked by George IV. Most 
readers know him as the preacher of a sermon which was the text of Burke’s 
Reflections, most students of economic history know him as the inventor of that 
theory of the virtue of a Sinking Fund which has been likened to the economic 
system of a community which prospered by taking in one another's washing; 
most vital statisticians remember him as the computor of the Northampton 
Life Table which gave a seriously incorrect picture of prevailing mortality and in- 
directly cost the country a large sum of money. Finally, in the controversy about 
to be described, Price was pertinaciously in the wrong on all the main issues. 

The apparent inference from all this is that Price was either a fool or a knave 
Gainsborough’s portrait of the Rev. Richard Price, which hangs (or did hang) 
in the Board Room of the Equitable Assurance Society, gives no support to the 
hypothesis that Price was a fool; his life would be a promising field of research 
for a young historian with a competent knowledge of economics. His importance 








224 Medical statistics from Graunt to Farr 


in statistical history is not great enough to justify me in a critical study (even 
if I had the necessary training in finance and economics). My guess is that Price 
was an able, self-confident, original-minded man, who knew a good deal about 
many things and had no exact knowledge of anything. He had ‘a way’ with 
him, he could interest people. In fact he had some of the qualities of Petty. It is 
easy enough to make jokes about his notion of the mysterious power of money 
to increase at compound interest and it is possible that William Pitt the younger 
(who was only a boy when he adopted Price’s theory) was not a good economic 
reasoner. Still, even 150 years ago, there were bankers and Treasury officials, 
and it is possible that both they (and Price) were not so much bad theoretical 
reasoners as shrewd opportunists, that they were deliberately blind to the 
speciousness of an attractive defence of a desirable financial expedient. I have 
myself sometimes wondered whether, in the eighteenth century, an Assurance 
Society would have minded very much if a Life Table had erred on the pessimistic 
side. 

Price did not enter on the population question with an unbiased mind. He 
was a keen politician and he believed that the policy of the government was bad 
for the country; he also believed that the wealth of a country was its people. 
Hence he believed that the population was declining and nothing shook that 
belief. Had he survived another ten years, until the first census, he would 
probably have disputed the accuracy of the returns. 

Price began with the figures of houses in 1690, which he cited from Davenant 
(they were really due to King, who communicated them to Davenant), making 
the total 1,319,000. He then gave the figures of assessed, chargeable and cottages 
(cottages being houses too small to be taxed) as 678,915, 25,628 and 276,149, 
making a total of 980,692 in 1761. In 1777 they were 682,077, 19,396 and 
251,261, a total of 952,734. On this basis he concluded that the population had 
declined by about one and a half millions and was actually less than five millions. 

Howlett and Wales, Price’s chief opponents, impugned every step in the 
reasoning. First, they pointed out that in the estimate for 1690 there was almost 
certainly a confusion between families and houses. Then they argued that many 
householders evaded duty (for instance by the simple plan of blocking up 
windows (the prayer ‘Lighten our darkness, we beseech thee, Oh Pitt’ is still 
remembered) and showed by direct enumeration in certain parishes that the 
returns were inaccurate. Finally, they gave reason to think that Price’s multi- 
plier was too small. On each of these points they were probably right. Indeed 
Price was obliged to admit the validity of some of their criticisms. But he 
declined to budge; sometimes he took ad captandum advantage of arithmetical 
slips by his adversaries, sometimes he declined to admit that their samples were 
representative, sometimes he tried to ignore the effect of corrections which he 
was forced to make. 


These were the principal arguments. Both parties used the data of burials 


Mayor GREENWOOD 225 


and baptisms as subsidiary arguments. Price seems only to have used the London 
Bills, which rather let him down; because although they seemed to help for some 
part of the century, he admits that by 1773 London was increasing and, very 
characteristically, uses this as in his favour: ‘ But it appears that, in truth, this 
is an event more to be dreaded than desired. The more London increases, the 
more the rest of the country must be deserted.’ Price’s adversaries went farther 
afield and counted burials and christenings in 162 parishes in all parts of England 
for two quinquennia, one beginning in 1758, the other in 1773. Baptisms increased 
from 47,638 to 59,567, burials from 49,553 to 53,030. 

But neither party put much weight upon what we should now consider 
primary evidence; rightly, because of its incompleteness. 

But these data were not wholly neglected by medical writers as we shall see 
in later sections. One may fairly say on the evidence here summarized that the 
eighteenth-century political arithmeticians of England made no advance what- 
ever upon the position reached by Graunt, Petty and King. They were second- 
rate imitators of men of genius. 


(Ta be concluded) 








MOMENTS OF THE DISTRIBUTIONS OF POWERS AND 
PRODUCTS OF NORMAL VARIATES 


By J. B. 8. HALDANE, F.RS. 


CONTENTS 
PAGE 
1. Introductory : : = . 226 
2. Distribution of the cube of a s normal variate - : ‘ . 228 
3. Distribution of any power of a normal variate. , 229 
4. Distribution of the product of three independent normal variates 230 
5. Distribution of the product of n independent normal variates 231 
6. Distribution of the product of two correlated normal variates . 232 
7. Distribution of the product of three correlated normal variates . 235 
8. The product of correlated normal variates : ‘ - . 238 
9. The Galton-Macalister distribution 239 
10. Biological applications 240 
11. Summary 241 
References 242 


1. IntTRODUCTORY 


WHEN numerous organisms or organs are weighed, the distribution of the weights 
is often positively skew. On the other hand, the distributions of linear measure- 
ments is often very close to normal. The question then arises, given that the 
volume of an organ is proportional to the product of three mutually perpen- 
dicular measurements, each being normally distributed, what will be the dis- 
tribution of the volumes? In general the three linear measurements will be 
correlated, and the problem might appear hopelessly complex. However, it will 
be shown later that, provided certain conditions are fulfilled, the distributions 
all lie very close together. 

The problem can obviously be generalized to cover the case of the product of 
any number of normal variates. The most interesting cases are those of two, 
three, or an infinite number of such variates. Further, two special cases are com- 
paratively simple. When the coefficients of correlation are all equal to unity and 
the coefficients of variation equal we are concerned with the distribution of a 
power of the normal variate. When the correlations all vanish, we are concerned 
with that of a product of several uncorrelated normal variates. 

It is not, of course, suggested that all skew variation of weights is to be 
explained on these lines. For example the Galton-Macalister distribution, in 
which the logarithm of the variate is normally distributed, can be thought of as 
arising in at least two different ways. The weight may be the product of a large 
number of normal variates ; or for constant cell size, the number of cell generations 





J. B. S. HALDANE 227 


may be distributed in a certain manner about a mean in each organ, these numbers 
being normally distributed. The highly skew variation of human weights is 
probably to be explained by the fact that a rather small fraction of the human 
race lays down very large quantities of fat. Nevertheless, it will be shown that 
simple criteria will determine whether observed positive skewness and lepto- 
kurtosis, too large to be ascribed to sampling error, can be explained on the lines 
discussed above. And in any particular case it is worth while finding out whether 
this is so. 

Any measure of asymmetry, such as y, = ,/f,, or of kurtosis, such as y, = £, — 3, 
is a dimensionless number independent of the unit of measurement. Hence in a 
transformed normal distribution of the type here considered it must clearly be 
a function of the only dimensionless number derivable from the first two moments, 
namely, the coefficient of variation, c. In what follows we shall generally use m 
for the mean of the original normal distribution and km? for its variance, so that 
k* is the coefficient of variation. The usual notation is used for the mean, variance 
and other moments and cumulants of the derived distribution. 

The distribution of the product of a pair of correlated normal-variates has 
already been fully discussed by Craig (1936). If m, and mg are the mean values 
of X and Y, k, and k, their coefficients of variation, and p their coefficients of 
correlation, then Craig finds for the cumulant generating function of 

XY 


mma (lke) 


0) = Let Wee) ”* Tee L log [1 —(1 +p) 0]— }log [1 +(1—p) 4] 
2[1—(1+ p)4}[1+ (1—p) 6] ~ 2 og [ ~{ +p) i-§ og [ +( p) . 





If k, = k, = k, this becomes 


6 
(0) = — = —i 1-(1 7}—1 log 1 1—p) 6). 
The rth cumulant of — = — is 

MM /(ky kg) 


Te 2p 
K, = ly! 1+p)"-!—(p—1)"-"% (; +——— ) 
4 | « p) (e—1)""} k, ky (ky ks) 


+ {0400-9 gy | +H DELO +0 + 
If k, = k, = k, the rth cumulant of X Y is 
K, = (mm) [(1 +p) tr! + dk{(1 + py’ + (e— 1) } (r—1)1). 
Craig’s discussion is mainly confined to the cases where k, and k, are large. In 


the cases of greatest biometrical interest they are small, and some points of 
interest arise, besides those dealt with by Craig. 








228 Powers and products of normal variates 


2. DISTRIBUTION OF THE CUBE OF A NORMAL VARIATE 


If x be a reduced normal variate, that is to say, a variate whose mean is zero 
and standard deviation unity, it is required to find the distribution of X*, where 
X = m(1+k'z). We note that 
_ (2r)! 


aT = 0, and 2® =(2r—1) (2r—3)(2r—5)...3.1 = 5. 





Hence _ - se 

X3 = m3(1 + 3k* + 3ka? + ki’) = m3(1+4+ 3k), 

X6 = m*(1 + 6ktE + 15kx? + 2Okix + 15k2x* + 6htad + 325) 

= m§(1+ 15k + 45k? + 15k°), 

so that the first four moments of X* about zero are 

Hy = m*(1 + 3k), 

pi, = m8(1 + 15k + 45k? + 15k), 

Hi, = m1 + 36k + 378k? + 1260k? + 945k4), 

Ma = m??(1 + 66K + 1485k? + 13860k3 + 51975k4 + 62370K> + 10395K*). 

Hence the moments about the mean m'(1 + 3k), and the cumulants, are 


Ky = fy = m1 + 3k), 


Ke = flg = 3m®k(3 + 12k + 5k*), | 

Kg = [lg = 54m%?(3 + 16k + 15k2), (1) 
Mg = 27m**k2(9 + 240k + 1326K2 + 19203 + dl 

Ky = 648m'2k3(7 + 48k + 75k? + 15k3). 


It is to be noted that in these calculations, and in the majority, though not 
all, those of this paper, the expressions for the moments about the mean are 
simpler than those for the moments about zero." Successive moments about the 


mean are therefore best calculated from the former, so far as possible, rather 
than the latter. Thus the equation 


Ha = Ma My (Af + 644 Me + 14°), 

involves less algebra than the more usual 
Ha = Ma Ap Mg + 6p? He — 3a". 
It follows that 

o = 3m®kt(1 + 4k + Sk2)t, 

(9k + 36k? + 15k*)! 
1+3k 

+ 6 (3k) (3+ 16k + 15k?) 


n= (3+12k+oey 


“— + 72k(7 + 48k + 75k? + 15k) 
Re is Bt Ba of 











(3+ 12k + 5k?)? 





J. B. S. HALDANE 229 


Hence the distribution is positively skew, and leptokurtic. If c is small, we 
have, approximately, 23 
Vi => 2c + 7 + sees 
2 
__ 6c | 16ct @) 
a A 
Since in most practical cases c? < 0-01, the first term will have an error of less 
than 0-1 °%, and no attention need be paid to the later terms. 


3. DISTRIBUTION OF ANY POWER OF A NORMAL VARIATE 


It is required to find the moments of m"(1+ktx)". The mean value of its rth 
power is clearly 





y= me| 14 (nr)!k Pi a zs (nr)! ke +...]. 


2(nr—2)!  8(mr—4)! °° 2°. 8!(mr—2s)! 
This converges for all positive values of n, since & is small, though it only 
terminates for positive integral values of nr. After somewhat tedious algebra, 
we find for the mean and other moments and cumulants: — 
Ky = fy = m1 + jn(n—1)k+...], 
Ky = fly = m*"n2k[1 + 3(n— 1) (8n—5)k+...], 





Kg = Mg = 3m>"n3(n— 1) k*[1 + 3(17n? — 55n + 44) k+...],} (3) 
My = 3m**ntk*[1 + 3(n — 1) (5n—7)k+...], 
Ke = 4m*"n4(n — 1) (4n — 5) [1 + 0(K)]. } 
Hence c = nkt[1 + 4(n—1)(3n—5)k+...], 


Y, = 3(n—1) kt[] + 3,(7n? — 38n + 43) k+...], 
Yo = 4(n—1) (4n—5) k{1 + 0(k)]. 


And when c is small 





yy =") 1 + 0(C%), 
(4) 
n= Gn Se" [1+ 06%. 


When n = 2, the expressions become-very simple, 
K, = m*(1+h), 
Kg = 2m*k(2+k), 
kK, = 8m®k*(3+k), (5) 
kK, = (r—1)!m*(2k)r (r +h). 
This is in accordance with Craig’s formula, putting k, = k, = k, p = 1. 











Powers and products of normal variates 


4, DISTRIBUTION OF THE PRODUCT OF THREE INDEPENDENT 
NORMAL VARIATES 
Let x, y, z, be independent reduced normal variates, so that = y? as gm L 
vi=yi=zt=3, etc. Let X= m(l+kix), Y=m(l+kty), Z=m,(1+k3z). 
Here k, k3, k} are coefficients of variation of linear measurements, so that k,, k,, ks, 
are commonly about 0-001. It is required to find the moments of the distribution 


of X YZ. In the expansion of a power of X YZ we need only consider even powers 
of x, y and z. For example, 





X2Y2Z2 = m?m3m3(1 + kix)? (1+ kby)? (1 + khz)? 
= memzm3(1 + k,x*) (1 + kyy?) (1 +k 32”) 
= mimzm3(1 + k,) (1 +k) (1+ ks). 
So if m,m.m, = V (a volume), the moments of X YZ about zero are 
= V, 
Hy = V2(1+k,) (1+ kg) (1+ks), 
1, = V3(1 + 3k,) (1 + 3k) (1 + 3k), 
Ma = V4(1 + 6k, + 3k2) (1 + 6k, + 3k3) (1 + 6h, + 3K). 
So Ky = fg = V2(2k, + 2k, ky + ky kyks), 
Kg = fs = 6V9(2Lk,k, + 4k, kyks), 
Ma = BV (TK + 2Dk, ky + CLK ky + 38k, ky kg + IDK 
+ 362k? hak, + 18Lk kak, + 9k2k3 k3, 


Ky= 6VN2SK ky + Lk2k3 + 4h, kyks(4+ 420k, + 22k, ky +k, kyks)]. 
In practice we can neglect all terms except the leading one, and write 
K,=V, 
Ky = V2(k, +k,+ks), 1 . 
Ky = 6V%(kgha-+kqk, + ky ke), (6) 
Ky = 12V4(k2k, + kok, + kok, + kh ko + kykgt+ kgki+ 8k, ky ks). | 
If k, = k, = k, = k, we find 
K, = V, 
Ky = V*k(3+ 3k+ k*), | (2) 
K, = 6V%k*(3+ 4k), | 
K, = 6V4k3(28 + 51k + 24k? + 4k), 
which may be compared with equations (1). Thus 
c = (3k + 3k? +k), 
<a 
Vy = 2+, 
56c2 22c4 (8) 


9 .* 


J. B. S. HALDANE 231 


It will be noticed that the leading terms of equations (2) and (8) are identical. 
Hence the two distributions will, in practice, be quite indistinguishable, even if 
tens of thousands of individuals are weighed. It follows that the cube root of the 
product of three independent variates is almost normally distributed. Further, 
the relations of y, and y, to ¢ are little altered when k,, k, and k, are different, 
provided that they are of the same order of magnitude. Thus if k, = 0-001, 

1 
3° instead of 2c, and y, = = = 5-3c? 
instead of 6-2c?. Only a very large sample indeed would reveal departures of this 
order from the simpler expressions of equations (2) and (8). 


ky = 0-002, ks = 0-003, we find y, = 


5. DISTRIBUTION OF THE PRODUCT OF 2 INDEPENDENT NORMAL VARIATES 


Let 21, %g, Ly, ..., Lp, ..., L,, be m independent reduced normal variates. The 
general case is of course somewhat complicated, so we shall only investigate the 
special case where all the coefficients of variation are equal. 

Then if X, = m,(1+k*z,), and M = IIm,, the moments about zero of the 
distribution of the product /7X, are 


= M, 

Bs = M%(1+k)", 

bs = M3(1+ 3k)", 

My = M4(1+ 6h + 3k?)", 

es = M*(1+ 10k + 15k)", 

Me = M®(1+ 15k + 45k? + 15k5)". 


Hence 
Hy = M*[(1+k)"—1), 
Ms = M7[(1+ 3k)" —3(1+k)" + 2], 
tg = M4[(1 + 6k + 3k2)" — 4(1 + 3k)" + 6(1 +k)" — 3], 
fs = M®[(1 + 10k + 15k2)" — 5(1 + 6k + 3k2)" + 10(1 + 3k)" — 10(1 +k)" — 4], 
Me = M®[(14+ 15k + 45k? + 15k3)" — 6(1 + 10k + 15k?)" + 15(1 + 6k + 3k)" 
— 20(1 + 3k)" + 15(1+k)"—5], 
k, = M4{(1+ 6k + 3k2)" — 3(1 + 2k + k2)" — 4(1 + 3k)" + 12(1 +k)" — 6}, 
k, = M%{(1+ 10k+ 15k)" + 5[ — (1+ 6k + 3k2)" — 2(1 + 4k+ 3k)" 
+ 6(1 + 2k + k?)" + 4(1 + 3k)" — 12(1 + &)"] + 24}, 
Kg = M%{(1+ 15k + 45k? + 15k%)" + 15[2(1 + 3k + 3k? + ks)” 
— (14+ 7k+ 9k? + 3k3)" + 2(1 + 6k + 3k?)" + 8(1 + 4k 4 3k2)" 
— 18(1+4+.2k + k?)" — 8(1 + 3k)" + 24(1 +k)" — 8] 
— 6(1+ 10k + 15k?)”" — 10(1 + 6k + 9k?)"}. 








232 Powers and products of normal variates 


The higher cumulants can be evaluated as follows: 


«© 


k, = 30> (") (3°21) k, 


r=1 


k= 34S (") (3°11) [(2+k)—4] F, ete. 
Thus we find _ 
k,= UY, 
Ke = M*nk(1+3(n—1)k+...], 
Ks = M?n(n—1)k*[3+4(n—2)k+...], 
ky = M*n(n—1) 5[4(4n — 5) + 3(13n?— 49 +.47) k+...], , 
Ks = 5M*®n(n — 1) k4[(5n — 6) (5n — 7) + 2(n — 2) (1430 — 345) (k + ...], 
Ke = 3M®n(n— 1) k°[432n3 — 1853n? + 2917 — 1758 + 0(k)]. 
These may be compared with equations (3). 
ce = (nk) (1+}(n—1)k+...], 


3(n—1)c 
y= iF O(e, 





and when c is small 


4(n—1)(4n—5)c? 
2 


73> * 


[1+ O(c*)], 
as in equations (4). 


When n = 2 we have the simple forms 


k, = M, 
K, = M*k(2+k), 
K, = 6M%k?, 


K, = 6M*k3(4+k), 
Ko, = (2r—1)! M*k?"-1(2r +. k), 
Koray = (27 +1)! M*+1k*, 


This is in accordance with Craig’s formula, putting k, = k, = k, p = 0. 


6. DISTRIBUTION OF THE PRODUCT OF TWO CORRELATED NORMAL VARIATES 


In general the different linear dimensions of an organ or organism are posi- 
tively correlated. Organic correlations may reach very high values, such as 0:9, 
and presumably even higher values would be found for two approximately equal 
diameters of an approximate solid of revolution, such as an apple or an egg, where 
p may be taken as unity. On the other hand, quite low values are found. Thus the 


len 


ge’ 


J. B. S. HALDANE 233 


length and breadth of a homogeneous group of like-sexed adult human skulls 
generally show a correlation of about 0-3 or 0-4. Hence the area of an organ will 
commonly be proportional to the product of two correlated variables, the 
volume to the product of three. 


Let x and y be two reduced normal variates, with correlation p. The cumulant- 
generating function for their joint distribution is }(é?+ 2ptu + u?). That is to say 


. ' —— ; — : fur . 
if r+s is odd, then 2’y* = 0. If r+s is even, then z’y° is the coefficient of ee in 


exp }(t? + 2ptu + u?). That is to say, x"y° is the coefficient of fu* in 


(é? + 2ptu + u*)ir+), 
multiplied by a whet 
2ir+9)[ 4 (r+s)]!° 

It follows that 

z=1, xy=p, 

w=3, 2xy=3p, xy? = 1422, 

a®=15, xy=15p, xy? = 3(1+4p%), ay = 3p(3+ 2%), 

z®= 105, z’y=105p, xy? = 15(1+ 6p?), ay? = 15p(3+ 4p?), 

aly! = 3(3 + 24p? + 8ps), 
ay? = 105(1 + 8p?), xy’ = 315p(1+2p?}, xy = 45(1 + 1297+ 8p4), 


a8yt = 315(1+ 1692+ 16p%). 
Of course xy" = 2x"y°. 
Thus the moments of zy about its mean p are 
Mg =1+p*, fs = 2p(3+p"), My = 3(3+ 14p? + 3p). 


Hence x, = 6(1+6p?+*). Hence the distribution is leptokurtic, and asym- 
metrical unless p vanishes. 
Now consider two correlated normal variates 


X=m,(l+azx), and Y=m,(1+6y), 


where a and 6 are coefficients of variation. Let m,m, = A. Then the moments of 
XY about zero are 


fy = A(1+pab), 
fy = A*[1 +a? + 4pab + b? + (1+ 2p?) ab’), 
bs = A*[1 + 3(a? + 3pab + b*) + 9ab{pa? + (1 + 2p?) ab + pb?} + 3p(3 + 2p?) a®b5}, 


wy = A*[1 + 2(3a? + 8pab + 362) + 3pat + 16pa%b + 12(1 + 2p?) a2? + 16pab? + 3pb4 
+ 6a%b*{3(1 + 4p?) a? + 8p(3 + 2p?) ab + 3(1 + 4p*) b®} + 3(3 + 24p* + 8p*) a*d4). 











234 Powers and products of normal variates 
Hence 
ke = My = Aa? + 2pab + b?+ (1+ p2)a°b%I, 
K3 = Mls = 2A%ab[3{pa? + (1 +p?) ab + pb?} + (3 + p*) a*b?], 
Hy = 3A°[(a? + 2pab + b*)? + 2a7b*%{(3 + 7p?) a® + 2p(7 + 3p*) ab 
+ (3+ 7p?) b?} + (3 + 149? + 3p*) ab4], 
Ky = 6A%a%*[2{(1 + 3p2) a? + 2p(3 +p?) ab + (1 + 3p2) b} 
+ (1+ 6p? + p*)a*b?}, (10) 
in accordance with Craig’s formula. 
The most interesting case arises wheri a = 6 = k!. This case is important 
because in practice the coefficients of variation of different linear dimensions of 
the same organ are often nearly equal. Thus those of linear skull measurements 


in like-sexed adults in a racially homogeneous population are about 0-03, so that 
k is about 0-001. The moments amd cumulants of X Y are then 


K,= A(1 + pk), 
Kg = fl, = A*k[2(1+p)+ (1+?) &), 
Ks = fl, = 2A%k*[3(1 +p)? + (3 +p?) Kk], 
[ty = 3A4K[4(1 + p)? + 4(3 + 7p + Tp? + 3p%) k + (3 + 14p? + 3p4) k?], 


Ky = 6A*k[4(1 + p)? + (1 + 6p? + pt) Kk). (11) 
Hence st ’ 1—4p— 3p? 
while V1 = $ce[1+ 0(c*)], 


Ye = 3c*[1 + O(c*)], 

exactly as found when the two variates are in a constant ratio or quite indepen- 
dent. When however a + J, this is no longer the case, for it is clear that the distribu- 

: . a2 62 
tion becomes normal when a or 6 vanishes. If ne = p, so that p> 1, we have, 
for equations (10), 
1+2pp+p?3c | 1+ 3pp—3p?+ pp* “2 

os tae (p+p) 
both approximately. However, a and b may differ considerably without any very 
great effect of y,/c or y,/c. Thus if a = 26, so that p = 3, 


i 


8(2+ 5p + 2p?) 3c 
eae 2” 
That is to say, y, is 64 % of 3c/2 -f p = 0, and 86 % if p = }, whilst y, is 51 % of 
3c if p = 0, and 71 % if p = }. 
So far we have assumed that p is not negative. If p= —1, XY = 1—ke2?, 
so the mean is A(1 —k), and the other cumulants are given by 


K, = (—)P 2°-Mr—1)! Ark. 








J. B. S. HALDANE 235 


k ’ : : etic i 
If p=—1+ 2 5+s gt ++ Ks in equations (11) vanishes, though the distribution 


does not become quite symmetrical. However, negative values of p are of no 
biological interest. 


7. DISTRIBUTION OF THE PRODUCT OF THREE CORRELATED NORMAL VARIATES 


Let x, y, z, be three reduced normal variates as before, their correlations being 
-=*, pb... =A = mw. Thus the cumulant-generating function is 
yz 2x ’ Pry i 5 


3(0? + wu? + v2 + Qeuv + 2Avt + 2utu). 
Odd moments vanish, and the even moment x”y%z" is the coefficient of t?u%”" in 
(22+ u2 + vw? + 2xuv + 2Avt + 2utu)P+a+, 
multiplied by ai wanes tqer]! 
The required moments of products of two variates, such as 
ay =p, xy? = 3(1+4y%), 
have already been given. The required moments of products of all three are: 
xyz =K+2An, 
atyz =3(k+4Apm), xy2z = 3(A4+ 2x4 2xp?), xyz? = 14 2DK24 8KAp, 
xty®z = 3(3K + 12Am + 12k? + 8Ap), 
aty?2? = 3(1 + 2x2 + 4024 4u?+ 16xAw + 8A2n2), 
asy3z? = 3(3u + 6KA + 23 + 6x2 + GAZ + L2Ap?), 
aty2? = 3(3x + 12x24 12A2+ 24u2 + 96KAp + 48x22 + 48A2u2 + But + 64KAp), 
xtysz3 = 9(3K + 12Ap + 2«3 + 12KA? + 12K? + 24K7Ap + BASU 
+ 8A8u + 8Am3 + 24KA2n?), 
vtytzt = 9[3 + 242K? + 8Lxt + M6LAZu? + G4KA (3 + 2LK? + 3xKAp)]. 
Other moments (except those such as xy2z = A+ 2x, which are derivable by 


transposition) are not needed for our purposes. It follows that xyz has a sym- 
metrical and leptokurtic distribution, with mean zero, and 


Ky = fy = 14+ 22K? + 8xAy, 
= 12[24+ 17XK? + 52x44 TOLAZRu? + 4KAw(35 + 22k? + 32xAp)]. 
If x. = m,(l+ax), Y= m,(1+b6y), Z=m,(1+cz) the expressions for the 


higher moments are very complicated, though it can easily be shown that the 
mean is m,m,m,(1+«xbc + Aca + ab), and the variance 


mms m>| La? + 2Zebe + X(1 + x?) be? + 25(2K + 3Ap) abc 
+ (1+ 22%? + 8xAp) a*b?c*}. 


Biometrika xxxu 16 











236 Powers and products of normal variates 


We shall only give further consideration to the case where all the coefficients 
of variation are equal. And we shall confine ourselves to two special cases of this. 
In the one x = A = » = p, that is to say the variates are equally, and therefore of 
course positively, correlated. This is not very far from the case with the human 
skull. In the other, x = 1, whilst A = ~ = p. This is appropriate to an approximate 
solid of revolution, such as many eggs and fruits, or to a regular prism such as 
some sponge spicules. 

In the first case we have for the moments of products of three variables: 


xyz = p(1+ 2p), 
atyz = 3p(1+4p), xyz = 3p(1+2p+2p?), xyz? = 1+ 6p? + 8p, 





aty’z = 3p(3+ 120+ 12p?+ 8p), axty2z? = 3(1 + 10? + 163 + 8p4), 
xy3z3 = 30(3 + 6p + 14p? + 12p3), 

atytz? = 3(3 + 48p? + 9693 + 104p* + 6495), 

aty3z3 = 9p(3 + 12p + 26? + 40p? + 2494), 

atytzt = 27(1 + 24p2 + 64p3 + 104p* + 128p° + 649%). 

So if X =m,(l+kx), Y=m,(1+kty), Z = m,(1+k!z), and m,m,m, = V, 
the volume which is the product of the means, then the moments of X YZ about 
zero are 
‘ = V(1+3pk), 
py = V2[14+3(1+ 4p) k+3(1+ 4p + 10p?) k + (1+ 6p? + 8p%) k?), 

3 = V3{1+9(1 + 3p) k+ 27(1 + 5p + 8p?) k®? + 9(3 + 21p + 54p? 
+ 62p%) k3 + 27p(3 + 6p + 14p? + 12p%) k4], 
ui, = V4{1+ 6(3 + 8p) k + 9(13 + 64p + 88p?) k2 + 36(9 + 64p + 160p? 
+ 152p3) k3 + 27(13 + 128p + 448p? + 768p3 + 568p%) k# 
+ 54(3 + 24p + 144p? + 304p3 + 4244 + 256p5) k5 
+ 27(1 + 24p? + 64p3 + 104p4 + 128p° + 64p®) /:*). 
Hence 
K, = #4, = V(1+3pk), 
Ke = fly = V2k[3(1 + 2p) + 3(1 + 4p + 7p?) k + (1 + 6p? + 8p) k*], 
Kz = Ms = 6V%k*[3(1 + 2p)? + (41 + 27p + 60p? + 53p%) k 
+ 3p(4+ 9p + 8p? + 14p%) k?], 
Mg = 3V4K?[9(1 + 2p)? + 2(37 + 222p + 471 p? + 350p%) k + 3(39 + 316p 
+ 1038p? + 1584p + 1001 p*) k? + 8(3 + 24p + 127p? + 268p3 
+ 346p* + 192p5) k3 + 9(1 + 24p2 + 64p3 + 104p4 + 128p5 + 645) k4], 
K, = 6V4k9[28(1 + 2p)? + 3(17 + 1449 + 468? + 68893 + 41 1p) k 
+ 12(2+ 17p + 92p? + 193p3 + 241p4 + 130p5) k? 
+ 2(2 + 51p? + 140p3 + 225p4 + 264° + 128p°) k3]. (12) 


J. B. S. HALDANE 237 
These expressions reduce to equations (1) if p = 1, and (7) if p = 0. When k 
is small, c= 3(1+ 2p) k, Y= 2Qc[1 + 0(c?)], 


56c? 
Yo = [1+ 0(¢)}. 
Thus the relation of y, and y, to ¢ is almost independent of the coefficient of 
correlation. 


We next consider the case when z = y, so that x = 1, whilst A = » = p. This 
is most simply solved by finding the moments of (1+kx)(1+k'y)®. Thus if 
X = m,(1+ktx), and Y= m,(1+ ky), the moments of X Y? about zero are 


My = V(1+(1+ 2p)k], 
My = V7[1+ (7+ 8p) k+ 3(3+ 8p + 4p?) k? + 3(1+ 4p?) k], 


Mg = V3[1+ 18(1+p)k+ 18(5+ 1lp+ 5p?) k® + 30(5 + 15p + 18p? + 4p%) k* 
+ 45(1+ 6p + 6p? + 8p%) k*], 
ui, = V4[1 + 2(17 + 16p) k + 3(127 + 256p + 112p2) k2 + 84(21 + 649 + 64p? 
+ 16p%) k? + 105(31 + 1289 + 1927+ 12893 + 16p*) k* 
+ 630(3 + 16p + 32p? + 32p + 16p*) k5 + 315(1 + 1692+ 16p*) k*]. 
Hence 
Ky = 4, = V[1+(1+ 2p) k],. 


Ky = fa = V*k[5+4p+ 4(24+ 5p +4 2p?) k + 3(1 + 4p?) k*], 
Ks = Ms = 2V%k*[3(8+ 149 + 5p?) + 2(29 + 849 + 87p? + 16p*)k 
+ 9(2+ 149 + 13p? + 16p%) k?}, 
[ty = 3V 4K (5 + 4p)? + 8(40 + 1138p + 95p? + 223) k + 2(427 
+ 1628p + 2376p? + 1376p? + 160p% + 160p*) k? 
+ 24(24 + 121p + 237p? + 2343 + 104p*) k® + 105(1 + 16p? + 16p*) k*), 
Ky = 24V4k3[30 + 80p + 65p? + 14p3 + (95 + 364p + 513p? + 292p8 
+ 32p4) k + 3(22 + 116p + 227p? + 2143 + 96p*) k? 
+3(4+ 67p? + 64%) Kk]. (13) 
Hence, approximately, 


c? = (5+ 4p)k, 
_ (2+) (4+ Sp)e 
1+” 
__ 24(30 + 80p + 65p? + 14p%) c? 
v2 = (5+ 4p) 
Hence y, varies between $3c, or 1-92c when p = 0, and 2c when p = 1. This is 
its maximum, so for high values of p, y,/c is nearly constant. It vanishes when 
p = —0°8. 7, increases from 444c’, or 5-76c?, when p = 0, to 5,8c*, or 6-2c? when 











238 Powers and products of normal variates 


p = 1. This again is its maximum, so y,/c? is nearly constant for large p. y, does 
not vanish for any admissible values of p. In fact for positive values of p it would 
be impossible except with enormous samples, to distinguish this distribution 


from that of the cube of a normal variate. If p = 1, equations (13) reduce to le 
equations (1). If p = 0, they give the cumulants of X Y?, where X and Y are ti 
uncorrelated normal variates with the same coefficient of variation, namely, t! 

kK, = V(1+h), K, = V2k(5 + 8k + 3k?), b 


ky = 4Vk2(12 + 29k +92), ky = 24V4K3(30-+ 95k + 66K? + 12K3), . 


which may easily be obtained independently. 


‘ 
Z 

8. THE PRODUCT OF 2 CORRELATED NORMAL VARIATES , 

If the variates are X,, X,, ..., X,, ..., X,, where X, = m,(1+a,2,), and / 


P = IIm,, while x, is a reduced normal variate, and p,, is the coefficient of corre- 
lation of x, and x,, and hence of X, and X,, the general expression for the moments 
of X,, X,, ..., X,, is complicated. It can however easily be seen that the mean is: 


Pil + {Pgh Ag a 2 (PrsPru a PriPsu + PruPis) a,A,a,0, +... |, 


while fy = P*[Xa2 +22 p,,a,a,+ ...]. 


8 


If every a, = kt, and every p,, = p, then xf x}x}... vanishes when m = 2a, 
is odd, and when m is even it is the coefficient of (#¢#... in the expansion of 
(21? + 2pXt,t,)”", multiplied by ,!a,!a,!/(2"m!). Thus the moments about 
zero are: 
m=Pli + kn(n— 1) ph + gn(n— 1) (n — 2) (n—3) pk? + ...4 chia +...| 
i : 2'r!(n—2r)! ; 
Mg = P2{ 1+ n{l4+2(n—1)p}k+ bn(n—- 1) 

x {1 +4(n—2)p+2(2n*—6n+4+ 5) p*} kh? +...], 
ts = P81 + 38n{l + 3(n— lppk+ n(n—1) 
x {1+ (8n— 4) p+ $(9n?—21n + 14) p*} P+ ...], 


So 
My = P®nk[l + (n—1) pt 3(n—1){14+4(n— 2) p+ (3n?—9n +7) p*tk+...], 
HM, = 3P?n(n— 1) [1+ (n— 1) p*k*[1 + 0(4)]. (14) 
Thus c = [n{l+(n—1)p} k}§ (1+ 0(4)], 
r= an FOP) Te 0G) (15) 
3(7 


= —— ) ear + 0(c?)], 


and presumably y, etc. are approximately the same functions of c as in the case 
of equations (9). 





J. B. S. HALDANE 239 


9. THE GaLTON-MACALISTER DISTRIBUTION 

Consider the distribution of the nth power of a normal variate when n is very 
large, and k very small, but nk! remains constant. All but a few terms of equa- 
tions (3) will vanish, and if nk? is small, the distribution becomes identical with 
the distribution described by MacAlister (1879) whose first four moments have 
been given by Pearson (1905). Its cumulants can readily be found as follows: 
Given that 2 = log X is normally distributed, to find the distribution of X. Let 
m and s be the mean and standard deviation of x. Then X = e*. Hence, since the 
moment-generating function of x is M(t) = e+”, the rth moment of X about 


zero is ; rH, , 
MW, =e = 14+ 7fy toy te = M(r) = em"tisr?, 


where /4;, /ég, etc. are the moments of x about zero. Let M = e™+#s, ] = e’. Then 
“= M'l"’-). Hence the cumulants of X are 

k, = M, 

f*(l—1), 

kK, = M%(l— 1)? (1+ 2), 

kK, = M4(l—1)3 (13 + 31? + 61+ 6), 

K, = Mi — 1)! (18+ 415 + 1014 + 20/5 + 30/7 + 361 + 24), 

Kg = M®(l—1)5 (° + 519 + 1518 + 3517 + 701% + 1201° + 18014 

+ 240/8 + 2701? + 2401+ 120), (16) 

or, if /—1 is a small quantity, g, approximating to s, 


x, = M, 
K, = M’q, 
K, = M%q?(3+4q), 
K, = M4q3(16+ 15q + 6q?+ 4°), 
kK, = M®q*(125 + 222q + 205g? + 120g° + 45q4 + 109° + 9°), 
Kg = M®q°(1296 + 3660g + 57009? + 5165q° + 494594 + 2997q° 
+ 13659° + 455g? + 105g8 + 15° + q?®). 
Hence c* = gq, 
¥, = 3c+0, } 
Vo = 16c?+ lic*+..., | 


7 ; (17) 
Vg = 125c? + 222c5+..., | 


Yq = 1296c* + 3660c® + .... 

The first terms represent the limiting values of equations (3) and (9). The 
leading term of y, up to y, at least, is (r+ 2)’. Thus for a given coefficient of 
variation, 7, is 50% greater than in the case of the cubed normal variate, 7, is 
157 % greater. 








240 Powers and products of normal variates 


10. BIOLOGICAL APPLICATIONS 


Wilson and Hilferty (1931) showed that the cube root of x? is almost normally 
distributed when n exceeds 2. Haldane (1938) gave the value of the cumulants 


in this case, and showed that, for large values of n, (13x? — n)' is even more nearly 
normally distributed. It is interesting to compare the cumulants of the cubed- 
normal distribution [equations (2)] with those of the x? distribution. The latte~ 


2 
are K, = 2, Ky = 2n, Kz = 8n, kK, = 48n, etc., so that c - [-. V1 = 2, Yq = 6c. 


Thus if we compare the y? distribution with a cubed-normal distribution of the 
same coefficient of variation, we find that they are equally asymmetrical, but 
that the x? distribution has a y, which is 33th that of the cubed normal. Hence 
the same transformation will nearly-abolish both x, and x, for both distributions. 

The success of Wilson and Hilferty’s transformation suggests strongly that 
we may use equations (1), (5) or (3) for the approximate normalization of moder- 
ately skew variates. This is an urgent problem in several applications of statistics 
to biology (Haldane, 1939). We evaluate a number whose mean and standard 
deviation in the case of random sampling are known. We desire to know whether 
it differs significantly from the mean. But the sampling distribution is found to 
be skew. If we can approximately normalize it, our tests of significance become 
far sharper. This problem is taken up in detail elsewhere. If y?>%;y., and both 
are small, so that k? can be neglected, we can find m, n and k so that (m+ ktx)” 
has a given y, and y,. By equations (3) and (4), 

47% vi Ke 
n=1+ 16y3— 97.’ c= o(n— 1 2 m*" = - 
If, however, y, is not small, it is necessary to take several terms of equations (3). 

If in an observed distribution of weights or volumes, the estimate of y,/c, 
or of K,k;/«K3 is approximately 2, it will be reasonable to try whether y,/c? or 
kx,/k3 approximates to 58, and if so to try to fit a normal distribution to the 
cube roots of the variate. It will be seen that this does not imply that all the 
objects considered are of the same shape. On the contrary, such a distribution 
is to be expected whenever three mutually perpendicular measurements are 
normally distributed, provided that their coefficients of variation and correlation 
are not very different. And even the greatest differences in the latter, provided 
they are not negative, will have little effect. 

Unfortunately, reliable estimates of y, and y, can only be obtained from 
samples of the order of 1000 or more. Rendel (in a paper to be published shortly) 
obtained the following estimates for the cumulants of the weight distribution of 
1202 viable duck eggs, corrected for grouping. The unit is a gram: 


k, = 73-294, k, = 4269, k,=+118-658, k, = +937-21. 








J. B. S. HALDANE 241 
Hence the estimates of c, y, and y, are 
c = 0-0887+0-0018, g, = +0-432+0-071, g, = +0-524+0-142. 


The standard errors are those appropriate to a normal distribution, but the true 
values cannot be very different. It is clear that g, and g, are significantly greater 
than the values of 0-27 and 0-13 which we should expect were the logarithms 
of weights normally distributed. They differ still more from what would be ex- 
pected were the cube roots of weights normally distributed, or on any other 
hypothesis leading to a similar distribution. 

Pearl (1905) lists the moments of the distributions of brain weights of eight 
European populations of 197 to 529 individuals. The various estimates of c are 
all close to 0-080, ranging from 0-074 to 0-083. Those of y, range from + 0-11 to 
+ 0-40, those of y, from — 0-30 to + 1-5. The weighted means are 


c = 0-07965 + 0-00106, g, = + 0-2306+0-0461, g, = +0-2661 + 0-0922. 


If the cube roots are normally distributed, we should expect g, = + 0-16, 
Jo = + 0-037; if their logarithms are normally distributed, we should expect 
g, = + 0-24, g, = + 0-10. The latter distribution gives the better fit, but the first 
is not impossible. 

Sinnott (1937) gives a graph of the distribution of the weights of squash fruits 
in an F,, which is positively skew. He shows that a graph of the distribution of 
their logarithms, though negatively skew, is more symmetrical. There is a sug- 
gestion that a graph of the distribution of their cube roots would be even more so. 
Unfortunately the actual figures are not given, and since curve-fitting by eye is 
notoriously uncertain, no more can be said. It is much to be desired that, when 


the full data are not given, estimates of the first four moments or cumulants 
should be published. 


11. SuMMARY 


The first four moments of a number of powers and products of normal variates 
are calculated, with special reference to the probable distributions of weights, 
volumes, or areas of organs and organisms. In each case the first two measures 
(y, and y, or f, and £,) of deviation from normality are obtained in terms of the 
coefficients of variation. The expressions obtained are almost independent of the 
correlation between the linear measurements, provided the coefficient of varia- 
tion of the latter are approximately equal. The distribution found is perhaps 
applicable to data on brain weights, but not to data on ducks’ eggs. 








242 Powers and products of normal variates 


REFERENCES 


Craic, C. C. (1936). On the frequency function of xy. Ann. Math. Statist. 7, 1. 

Hapang, J. B. 8. (1938). The approximate normalization of a class of frequency distri- 
butions. Biometrika, 29, 392-404. 

(1939). New data on partial sex-linkage in man. Proc. Int. Genet. Congr. p. 137. 

MacAtistTeEr, D. (1879). The law of the geometric mean. Proc. Roy. Soc. 24, 367. 

PEARL, R. (1905). Biometrical studies on man. I. Variation and correlation in brain- 
weight. Biometrika, 4, 13-104. 

Pearson, K. (1905). Das Fehlergesetz und seine Verallgemeinerungen durch Fechner und 
Pearson. A rejoinder. Biometrika, 4, 169-212. 

Sinnott, E. W. (1937). The relation of gene to character in quantitative inheritance. 
Proc. Nat. Acad. Sci., Wash., 23, 224~7. 

Witson, E. B. & Hitrerry, M. M. (1931). The distribution of Chi-square. Proc. Nat. 
Acad. Sci., Wash., 17, 684. 





THE TRANSFORMATION OF DATA FROM ENTOMOLOGICAL 
FIELD EXPERIMENTS SO THAT THE ANALYSIS 
OF VARIANCE BECOMES APPLICABLET 


By GEOFFREY BEALL 
Dominion Entomological Laboratory, Chatham, Ontario, Canada 


1. IytTRODUCTORY 


THE present paper deals with experiments on the control of insects in the field. 
In such experimental work the problem to be investigated is whether more insects 
survive on plots which have been subjected to one treatment than on plots 
subjected to another. It will be shown in the present paper that the numbers of 
insects found per plot must vary in such a way that one cannot, strictly, subject 
the results to the analysis of variance, and it is proposed to find how the data 
may be transformed so that analysis of variance becomes applicable. Such 
transformation has been discussed by Bartlett (1936a,b) in connexion with 


entomological experiments, and by Tippett (1934) in connexion with industrial 
experiments. 


2. EXPERIMENTAL RESULTS CONSIDERED 


The data used in the following work are results from seven insecticidal 
experiments arranged by the author at Chatham, Ontario. The work was carried 
out with replicated blocks containing plots subjected to treatments of which the 
assignment was random. This procedure, normal in agronomic work, was supple- 
mented by one repetition of each treatment within a block. The assignment of 
the repetition of a treatment was independent of the first for that treatment, 
except that, of course, the same plot could not be chosen twice. This repetition 
was carried out to obtain estimates of variability within blocks. In these experi- 
ments complete counts were not made but random sampling was employed. 
Experiments on Pyrausta nubilalis Hubn., reported by Beall et al. (1939), for 
which results are shown in Tables | and 2, were made on one area at two different 
periods, whereas experiments on Leptinotarsa decemlineata Say, for which results 
are indicated in Tables 3 and 4, were carried out on contiguous areas at the same 
time. Three similar experiments were carried out in one place on the tobacco 
hornworm, Phlegethontius quinquemaculata Haw., for which the data are shown 
in Tables 5-7. Reference is also made to the data from a uniformity trial on 
insects of Beall (1939). 


t Publication No. 2191, Division of Entomology, Science Service, Department of Agriculture, 
Ottawa, Canada. 











244 Transformation of data from entomological field experiments 


Table 1. Numbers of an insect, Pyrausta nubilalis, per plot. Experiment I 












































Block 
Treat- , 
ment | | | 
1 2 3 4 5 6 7 8 | 9 | 20 
a) aes 
1 15 23 21 31 22 14 i | | ao | 3a 
1 27 20 23 33 34 27 17 | 18 20 | 26 
2 19 12 34 16 20 10 24 | 23 a i: 
2 ll 28 37 16 26 18 19 | 13 10 9 
3 16 15 22 2 | 13 21 is | 38 27 | 10 
3 19 16 18 21 19 “4 | 21 | 18 12'| 18 
4 | M4 23 10 19 17 ei 4 | 18 8 | 17 
4 34 21 9 34 19 9 | 15 16 12 12 
5 16 | 166 | #19 26 15 11 | 18 | 23 i 
5 23 | 12 | 12 10 12 17 | 13 21 | 12 | 9 
6 2 | wm | i 10 ma i| am | % 17 a a 
6 | 15 | 16 | 15 28 | 3 | 2 | 7 5 | 4 
7 | 4 | 28 35 36 | 50 | 69 62 | 63 42 | 40 
| 7 | 47 | 81 30 | 6 | 3 | 2 | 7 | 47 50 | 43 
| | | 











Table 2. Numbers of an insect, Pyrausta nubilalis, per plot. Experiment II 





























Block 
Treat- | a 
ment | | | | 
| 1 2 D8 Oe BO a Bo Bie oe 9 10 
| Ie eee ee piso 
| | | | | 
| 1 32 38 | (27 7 i3 | 4 | 2 | 2 | 2 | 30 
a 18 40 39 | 12 19 | 2 | 30 | 19 18 | 28 
| | | | | | 
| 2 | 6 23 Shh 4 a ee Si 4 Bei) ae 
|} 2] 9 14 20 3 | 15 | 14 | 45 | 19 | 19 | 10 
| | | 

3 | 10 21 2 | 10 | 13 | 2 | 33 | 48 | 28 27 

3 | 4 21 ae de ce ck ee... | 18 27 18 

a oe . 17 a.) 8) @ iwigsnhetts. 17 

4 | 24 3 | 13 | 10 | 6 | w | & | | 3 7 
| 5 | 13 2} 5 | o | 18 | 10 | 33 | 23 | 2 | 36 
aod age 22 23 8 14 16 | 2% | 22 15 34 

| | | | | 

6 13 10 | 21 | 4 10 nae ak. 13 16 

6 | 17 9 | 29 ) 5 | 18 | 15 | 19 | 16 27 23 

7 | 37 58 | 28 ll | 24 | 44 | 30 | 44 56 45 
| 7 | 44 71 | 55 | 20 | 26 | 27 | 43 | 52 39 58 

| { 














GEOFFREY BEALL 


Table 3. Numbers of an insect, Leptinotarsa decemlineata, 
per plot. Experiment III 




















i 
Block 
Treatment 

1 2 | 3 4 5 6 7 

ee Be | | | 
1 305 391 420 355 | 287 | 175 454 
1 207 364 | 639 527 | 293 | 248 397 
2 97 49 | 21 eS) bo 10 
2 93 Sl | 37 | 24 | «2 1 
3 270 | #105 | 341 469 | 82 57 221 
3 153 | 190 | 348 212 | 100 | 285 309 
4 7 | @ | | 8 | :? = 4 
. - aw 2 = |; @ 1 3 3 

} | | | 











Table 4. Numbers of an insect, Leptinotarsa decemlineata, 
per plot. Experiment IV 








| | Block 
| Treatment — 
| | 
| 1 | a 4 5 | 6 
| | 
—<————$ —-—— oS Merri ieee te 
} 1 253 | 145 | 300 | 665 99 93 | 
1 239 | (265 166 | 230 302 | «237 «| 
2 | wt wtlsfs) om 14 | 5 
. ) St. oe WR a 159 | 108 14 13 | 
3 18 130 165 137 153 78 
3 40 137 118 142 239 63 | 
| 4 2 0 22 6 129 3 
4 2 l 31 8 9 s 





Table 5. Numbers of an insect, Phlegethontius quinquemaculata 
per plot. Experiment V 


> 





$$ 


| | Block 











| Treatment anaes ee - -~———-—-— 
1 2 | . 7 cee | 6 

j | | 

ie ai ta s + Ee | | 
a foe aS Sr ll 
1 } 4 - | @ TS 10 15 
2 0 1 | 0 1 1 
2 2 2 Fs 2 te 1 
a 15 17 2 | 3 | 8 | 16 
3 ao) 2 16 il | 13 | 25 

I | 








246 Transformation of data from entomological field experiments 


Table 6. Numbers of an insect, Phlegethontius quinquemaculata, 
per plot. Experiment VI 














| Block | 
| 
Treatment |—— Ln ew ett 
| 
eg 2 3 4 5 6 | 
1 12 13 a Ee 11 4 | 
1 13 9 on oe 5 10 | 
2 13 ae. l 5 7 
2 20 | 9 | 9 | 4 3: 3 
3 . = <2 | = 38 a: 2 I 
3 Jen ee oe ek 3 2 
4 1 | 1 0 1 4 3 
4 2 2 2 l 4 5 
5 13 | , 2 ae 3 6 ll | 
5 1 | 65 4 l 9 8 | 
6 7 | 6 8 9 6 5 
6 8 | 10 2 4 4 12 








| 


Table 7. Numbers of an insect, Phlegethontius quinquemaculata, 
per plot. Experiment VII 








: : 
| | Block 
| Treatment | pes 
| 
Bea Se a 4 5 6 
} } | | 
ee a a ee | 
1 |} 10 | 20 |} 14 10 17 14 | 
1 | 7 14 12 23 0) 13 
| 2 | ay 21 16 17 19 7 
| 2 | 17 ll 14 17 2] 13 
3 0 7 3 2 3 1 
| 3 1 9 l l 0 4 
4 3 12 4 5 5 2 
4 5 6 3 5 5 4 
5 3 3 3 i 3 6 
5 5 5 6 1 2 4 
6 1 15 15 13 26 24 
6 9 22 16 10 26 13 


3. THE RELATIONSHIP BETWEEN THE STANDARD DEVIATION AND THE MEAN 
IN THE EXPERIMENTAL DATA 


If x is the number of insects on one of a group of small contiguous areas, say 
plots, within a larger area, say a block, let the expectation of x over all these 
plots be M and the standard deviation be c; then over a number of the larger 





GEOFFREY BEALL 247 


areas, when the insects are distributed in a completely random fashion, from the 


Poisson distribution, o? = M. (1) 


As is discussed by ‘Student’ (1919) one cannot, however, anticipate that (1) will 
be satisfied when organisms occur in groups, as, say, when insects come from 
masses of eggs, or when there is a change in expectation from plot to plot within 
a block. Generally, o? will tend to be greater than M and we can only say 


o? = f(M). (2) 
The form of /(), in (2), must be considered carefully, since it bears on the form 


of the transformation which may be developed to make the standard deviation 
independent of the mean. 


In dealing with (2), Bartlett (1936a) started by supposing that, approximately, 
o= KM, (3) 

where K is a constant. Generally, in field data, however, the relationship between 
o* and M, or of their respective estimates, s* and %, does not, as in Fig. 1, appear 
to be linear; rather, the departure of s? from % becomes disproportionately great 
as % increases. This relationship between departures and the magnitude of the 
mean has been discussed by Clapham (1936) in connexion with data on the 
distribution of organisms differing from insects as much as flowering plants, and 


he showed that only those distributions with very low mean have the squared 
standard deviation close to the mean. 


Our discussion above on the shortcomings of (3) suggests the conclusion that 

o?— Mao M (4) 

is generally untrue. We propose to consider the possibility that the curvilinearity 
of (2) might be better met by supposing that 

o?— Mec M?*. (5) 

Equation (5) leads to o2? = M+kM?, (6) 
where k is a constant. It will be noticed that 

k = (o?-_M)M~ (7) 

is the Charlier coefficient of disturbance from a Poisson distribution. This 

coefficient was employed by Beall (1935). 

It is possible to consider the suitability of (3), as compared with (6), by finding 
how, respectively, they fit observations on s? and %. To fit exactly is difficult, and 
it was found necessary to fall back on an empirical determination of K and of k; 
thus, if there are a number of pairs of estimates, % and s®, from (3) and (6) we 
estimate K = Js?/53, (8) 

k = (2s*-—22)/22, ' (9) 


where & represents the summation over all pairs. 











248 Transformation of data from entomological field experiments 


Since in the work presented in § 2, % and s?, being based on only two obser- 
vations, are highly variable, these experiments do not show clearly the suitability 
of (3) and (6). Accordingly, reference is made instead to the data from the uni- 
formity trial on Leptinotarsa decemlineata Say of Beall (1939). When the mean 
and standard deviation of 144 sampling units within each of 16 areas were con- 
sidered, the estimates from (8) and (9) were K = 2-405 and k = 0-2548. For 
these values from (1), (3) and (6), curves, described as lines 1, 2 and 3 respectively, 
are plotted in Fig. 1, the observed values of mean and squared standard deviation 


3 
e * 


255 


20 


59 € 





0 





0 2 4 weé 8 10 
X 


Fig. 1. The squared standard deviation plotted against the mean for 144 small areas within each 
of 16 large areas; line 1 is from equation (1), line 2 from (3) and line 3 from (6). The counts had 
been made on Leptinotarsa decemlineata Say. 


are also shown. In the cases where the mean is near unity the departure of the 
squared standard deviation from the mean, i.e. from line 1, appears to be trivial, 
but as the mean increases the departure becomes more marked. It can be seen 
that the observations lie more snugly about line 3 from (6) than about line 2 
from (3). Generally, for the data from field studies the same effect has been 
observed. Such results suggest that (6) may be generally a better approximation 
to the form of f(M) than (3) and make it preferable to proceed with the analysis 
of data from the assumption (6). 


4. THE TRANSFORMATIONS OF FIELD DATA 


Fig. 1 shows clearly how, within an area, the variability of the numbers of 
insects on sub-areas is related to the mean number of insects per sub-area. This 
relationship will make invalid the use of the analysis of variance on experimental 


GEOFFREY BEALL 249 


results involving counts on insects, since the expectation of the variance 
should be the same for all plots. To overcome this invalidity, Bartlett (1936a) 
suggested transforming the observations, x, from the basis of (3). The transforma- 
tion found was z', which Bartlett modified to (x+})?. From §3 it was seen, 
however, that for field data the relationship between standard deviation and 
mean may be represented better by equation (6) than by (3), and, since the form 
of the transformation depends on the form of f(), a fresh transformation must 
be sought. A transformation, as is developed in the Appendix to the present paper, 
is suggested by the method of Tippett (1934), i.e. 


a’ = k-+sinh— (kz). (10) 


An advance note of this transformation was published by Beall (1940). The 
adequacy of this transformation must be judged from the extent to which it 
stabilizes variability. In (10), if we express sinh—! (kx), when kx <1, as a well- 
known series, we have 


ve =a—tkrt+ S Pei—- 3, ezi+..., (11) 


where it is obvious that for k = 0, x’ = xt. Of course, for large values of kz, 
x’ varies almost as log x, or as the log (x + 1) used by Williams (1937), and so our 
proposed expansion may be regarded, for practical purposes, as embracing the 
root and logarithmic transformations. 

Table 8 gives the transformation, (10), for a probable range of observations, 
x, and for k at intervals which will probably be close enough for practical purposes. 
This table was computed in part by inverse interpolation from the table of hyper- 
bolic functions of the Smithsonian Mathematical Tables (Becker & Van Orstrand, 
1931), and in part from (12). Should values of x’ be required outside those of 
Table 8, these can conveniently be calculated from 


x’ = k-*log, {(kx)* + (1+ ka)*}. (12) 


In preparing Table 8 the question arose of whether, instead of dealing with 
k-*sinh-1 (kx)#, one should not use k-+sinh—! {k+(x+4)#} in the same way as 
Bartlett (1936a) dealt with the transformation, («+ 4)*, instead of zt. This 
modification was rejected on the basis of results of the transformation, as dis- 
cussed in § 5, since it was found that the addition of } made little difference and 
did not give, consistently, an improvement. 

For field data, in making the transformation (10), it is necessary to estimate 
the value of k empirically by (9) for which estimates 7%, of the mean and s, of the 
standard deviation, must be found. The most obvious method in practice of making 
these estimates seems to be to put more than one plot subjected to a given treat- 
ment in a block and so to estimate the chance variation of results for a plot within 
a block. In the present work, as is discussed in -§ 2, two plots were subjected to a 
given treatment in each block and this is probably good practice. 




































| eee 1L:G FOG 66°& be [Lt 6€ | LI €o-P GLb | OF oo | £99 | 6-9 6& 
eve | 69% | £6-6 Lot 8 | 69 | GSE | FI | ChE | 89F | OKF | BIG | LEG | 91-9 8 
I°3 | 893 | 16% Ge | OFF | 99E | OBE 1+ | oF tor | cRP | ELe I¢¢ | 809-/ LE 
6FS | 99Z | 68S te | SHE | WE | CBE 80h | BH | 69-4 Ish | 809 | SHS | 00-9 9 | 
SZ | GS | RBZ IS& | ohE I9€ | O8F GOH | SEF | Goh Lr | GOS | SEG | ZéES og 
OFZ 39° | 98-2 Gl | SHE | BFE | LLE 10 |. FE IS-b IL> | LOb | ZES | SBE E 
GFZ | GS | HZ 9 | OWE | GUE | FLE | BOE | OFF | OFF | OOF | 16H | GAG | PLE && 
o< chZ | 09% | BR PIE | 8 SE | ILE | FE | Soh | BrP 19> | 98b | SI'S | 99-9 oe 
s GES | BSS | ORS Cre | SEE | OSE | LOE | OBE Ish | LED | OFF | O8F Ig | Log Ie | 
= OG | WG | BLS | BS | GOL | BE OE | WE | OBE | OIF | BEF IS | bLH | OS | s8hE oe | 
5 68-6 | 946 | 68S | LOG | BBE | ERE | IDE SI | Lob | Sb | LOF | LOR | BES 6 
> LEG LZ | L8S | WOE | OE | ORE | LOE LOb | Gh | 6EF 196 | 68h | 669 | 8% 
8 Ge | ILZ | ¢8d 0G | €Be LEE | ESE Cor | OL | CEH | KH | WH | OCS | LB 
<~ | 69S | BS | 860 | OCF | EEE | GFE L6-€ I> | Lob | SHH | LH |. OIG | 9 
> | | 99% | 643 | G66 | OIE | EGE | SHE | S6€ | GO-F [ob rb | 996 | OOF | 
- 9S | LLZ | BS | CIE | OBE | IKE | | 8E | GE | SIH | HH | LEb | 064 ¥% 
a 19S | $LG | GSS | COE | BE | LEE | | 186 | 86 | 8OF | 9b | CHF | O8F | £z 
& 89-3 IS | 983 | SOE | SIE | BEE CLs L8-€ OF | SIh | OFF | 69% 3 
> 6% 923 | 893 | 38d O€ | SIE | SBE 69 | OSE | FOE IL-+ 1€-F | 8o-4 13 
< 08% | oS | PHS | CLE | LOZ | GUE | EBE GE | TLE LE | WH | BH | LEK | 08 | 
S 81S | 16S | BFS | 19-2 | SL3 | $66 | FOE | SIE | FEE | gE Loe | GLE | POE | GIF | 98% | GI | 
3 G13 | 863 | os | Los | Is | 88s | 666 | Bre | Bee 6c | ILE | SBE | ZH | HH | SI 
3 GIS | SS | SS | EOS | LOT | HS | HES | LOE | BE | IS€ | £98 | OLE | BEE | SIF LI | 
= GUS | 8% | 6% | 6F3 | ZS | GLZ | 68Z 10€ | L€ | ShE 99° | I8€ | 00% 91 
5 9G | GIS | HS | LHS | LOS | CLS | EBS | HS | BOE | see She | OFF | OLE | LEE | ST 
& £0% | SL | 1€-Z | OPS | BS | 89S | LLS | BBs + OE | 9E | SEE | OPE | GSE | PLE ¥I | 
* 661 | LZ | 96% | 96% | LOS’! BS | OL] ISS $63 LI€é | SE | GEE | LHE [9-€ €I 
= 96:1 LOS | 13% | USS | IZ GGG £9°% EL: CBs | LOE | PL€ £S°E | PEE oe | ZI 
S 161 | GOS | 91% | GBs | ses | SPS | 98% | GOs | OLE 9% | £O-€ Iré | I | se | II 
= 481 | 261 | OL | 813 | 83S | OFZ | 8FS | 9S | 99% | 8-3 163 | 863 | LOE | OLE | Ol | 
S et | 26-1 HS | BL IZ | BES | GES LOS | O83 | LOS | BLS | BLS | BS | EZ | 008 | 6 | 
- OLT | Sl 461 | HOG | EIS | EBS | 6S | 9S | LHS | HS EFS | S| OLS | OLS | EBS ae 
3 OLT | 841 | 68T | 961 HOG | ELS | 81S | LS | BES IFS | SHS | CFS | HS | COS | GOT L | 
s fol | IL-1 | O8-T | 98 66-1 OG | OF | BIS | BIS | GS | GS | BS | 9S | OFS | GHZ | 9 | 
S re | OT OLT | SLT | Il 881 | B61 L6°I GUS | 80% (1s | WS | LIS | OS | WS | SF | 
oe rel | oct | ger | Zor | 29d SLT | LT O8-I 8-1 68:1 16-1 | $61 C61 L6°1 oOo | F | 
S Cel | 261 | Bh | OFT | OFT | HT | Lot | oct | ot | oot | sot | sot | ont | get | set | e | 
S ort | 8h GI | Sel | Lat O&T | Bel el CE LEI 86-1 | 6E1 OPT | OFT | ImT | 3 | 
e 88-0 | 060 | 20 | £60 | 60 | 960 | 960 | 160 | 860 | 860 660 | 66-0 660 | OOT | OOT | T | 
00-0 | 00-0 | 00-0 | 00-0 00-0 000 | 00-0 00-0 00-0 00-0 00-0 00-0 | 00-0 000 | ovo | 0 
eS Be) a ee ee ee —|— het ae 
Yor) | | 
N | , | 
00-1 08-0 09:0 | 0¢-0 OF-0 Of-0 | 3-0 02-0 CIO | O1-0 80-0 90-0 ¥0-0 20-0 00-0 | 
| y 
— _— L ee. 


g(Y) ;-YUIS ;_y = 2 woynumsofsun4y 24,7, *8 2Q°J, 














251 








GEOFFREY BEALL 





z 


Siow owooes 
ANANAAAAR 









































IL’¢ 
89°¢ 
99 
19:9 
Lg-g 
9-9 
6F'¢ 
GP¢ 
OF-¢ 
9&°9 


1&9 - 


96°9 
06-9 
91-9 
60°9 
20'S 
96°F 
88°F 
08-F 
Lb 


COP 
LoD 
oo-F 
L¥-P 
It? 
EP 
86°F 
03°F 
EL? 
b0'F 


S6-E 
£6°E 
16-€ 
68°E 
L8-€ 
GRE 
£8-E 
18-€ 
BLE 
OLE 
PLE 


[L-& 
69° 


99°€ 


Lg 
89-9 
£9-9 
89-9 
9-9 
6F-9 
eho 
86-9 
6 a) 
96°9 


06°9 
tI9 
L0-9 
00°9 
669 
P89 
GL’g 
99°¢ 
99-9 
SF-¢ 


€é-¢ 
96°¢ 
06-9 
€1-¢ 
S0-¢ 
L6-¥ 
88°F 
6L-P 
OL‘F 
69h 


LVF 
+ a 
CPP 
OFF 
LE-F 
SEP 
oE-F 
66°F 
96°F 
6h 
06'P 


3S 


= i o> 
BESS 
rere ee 


GS-L 


17 





























Biometrika xxx11 
































252 Transformation of data from entomological field experiments 


In the special case where there are two plots for the ith treatment (i = 1, ..., ) 
in the jth block (j = 1, ..., N), and so two observations, x;;, and x;;., the estimate 
of the mean will be written x;; and of the squared standard deviation 
8ij = B(ig, — Zia)’. (13) 
Then from (9), we estimate & from 


-1 


. ge . a ia 
k= 2) 2 X (ij. — Liza)? — p> D (isn +0) (ijt Pipa)” (14) 
t= = = j 


=1j=1 iz 


Be 


! 
= 


1 


li 
~ 
— 


and the calculation is very light. 


5. RESULTS SHOWING THE EFFECT OF THE TRANSFORMATION 
ON THE VARIABILITY OF DATA 


The adequacy of our proposed transformation may be judged in two ways: 
first, with respect to its effect, which we shall consider in the present section, on 
the differences between repetitions of a treatment within a block, and secondly, 
with respect to its effect, which we shall consider in § 6, on the behaviour of the 
quantities submitted to the analysis of variance. 

It is a fundamental assumption in the analysis of variance that the chance 
variability for each plot shall be, when the effect of block and of treatment are 
removed, normally distributed with a standard deviation common to all plots, 
in which situation of course the standard deviation of the chance variability for 
a given plot is independent of the expectation for that plot. In the data of the 
present work, where each treatment is repeated in each block, it is possible to 
examine the estimates of this standard deviation, s;;, and of the expectation, 2;;. 
For a clear graphical illustration of the situation consider Fig. 2, as obtained from 
the original data of Experiment III on Leptinotarsa decemlineata, where s;; is 
plotted against x;;, and contrast this situation with that obtaining for the 
corresponding quantities s;; and xj;, obtained after transformation (£ = 0-08) 
in Fig. 3. 

In Fig. 2 the points are widely scattered as is natural from a sample of two; 
nevertheless, it is apparent that for the smallest values of 2;; the values of s;; 
are correspondingly small and fall in a close group. In Fig. 3 the cluster of obser- 
vations in the lower left-hand corner of the previous diagram has disappeared, 
and generally the scatter appears to be independent of z;;, so that apparently 
the transformation gave satisfactory results. The nature of the material involved 
is such that it does not seem possible to examine the relationship under con- 
sideration more exactly, nor to summarize exactly the corresponding results for 
the other treatments; it can only be said that the same type of result appeared 
although the magnitude of the relationship before transformation depended on 
the magnitude of the differences between the effects of treatments. 

The results shown in Figs. 2 and 3 suggest that the proposed transformation 
has tended to make the standard deviation independent of the mean, in accordance 


GEOFFREY BEALL 253 


with the assumptions underlying the analysis of variance. In using this procedure 
one actually assumes, more broadly, that a common standard deviation exists, so 
that the homoscedasticity of observations before and after transformation should 
be tested. Thus it is assumed that %;;,; and 2;;. are observations from a normal 








sit 
= ° 
& 
1504 . 
= 
w e 
100- 
2 
e 
- e 
50+ e 
& 
a 
e € 
e 
wee se 2 ; 2. e . ‘ 
0 100 200 300 400 500 
Xi, 


Fig. 2. The standard deviation and mean as estimated from plots by pairs, 
with untransformed data on Leptinotarsa decemlineata. 








e 
2-04 
« 
1*54 
” * 
1-0 2 
e 
e ee 
0-54 eC e s% «°° e 
a 
® o © 
0 : . 2 -— 
0 2 ~ 6 8 


Fig. 3. The standard deviation and mean as estimated from plots by pairs, with the transformed 
data on Leptinotarsa decemlineata Say, i.e. using x’ = k-! sinh! (kx)!, (k =0-08), 
population with a standard deviation, 7, which is independent of i and j. Then 


: ] 
DY (Xi5~—- Xi; y2| ; is distributed as y? with one degree of freedom.+ Accordingly. 
Pim lg ie ; a a 


+ In the J, test, discussed by Nayer (1936), this case of estimates of standard deviation with 
one degree of freedom is troublesome since zero values tend to arise when dealing with grouped or 
integral observations. When this is the case L,, which is the ratio of an arithmetic to a geometric 
mean of sums of squares, cannot be calculated. The present treatment may therefore have a wider 
application. 








254 Transformation of data fram entomological field experiments 


Yiz = (% 45, — %jj)//20 should be distributed normally with unit standard devia- 
tion for alli andj. In order to test the hypothesis of normality with unit standard 
deviation it is only necessary to test for leptokurtosis; for the distribution must 
be symmetrical since the sign of differences, and therefore of y;;, is a matter of 
chance. Since the number of items involved will almost certainly be < 100, and 
since the population mean is zero, the w,, criterion of Geary (1935) will provide an 
appropriate test. In using this criterion we must find the ratio of the mean 
deviation to the standard deviation, i.e. 


liters «lees cee 5 
Wn bs >> | ri 252 | nN >> >> (X51 © 52) ° (15) 
\ }\ i=1j=1 


i=1j=1 
Of course, values of w,, may be calculated for transformed data by substitution 
of x/;;,, for x;;,,. 


Table 9. The w,, test on the homoscedasticity of counts by plots 
within a block for six field experiments 


© bk fr: Ae eo he ees : 7 
} | | . : Transformed oe q Transformed 
| = Untransformed (Bartlett) Value of k (Beall) 
iE . | Upper | Lower 
uxperi-| r | 5 O/ | 5 0, ! 
mN | 5 5 = se ee aT a4 ; ee 
sone | limit limit | | | 
; Departure | so Departure | Esti- Em- | ‘i Departure 
On by s.p. | by s.p. | mated | ployed | ” by s.p. 
& a ware SOE " 
| | | | | 
| I | 70 | 0-841 0-757 0-6659 — 5-33 0-7525 -1-91 0-078 0-08 0-7846 -- 0-64 
| 2 70 | 0-841 0-757 0-7885 - 0-49 0-7807 -0-79 0-046 0-04 | 0-7499 — 2-01 
| Ill 28 | 0-866 | 0-737 0-5973 - 5:33 0-6431 ~4:15 0-084 0-08 0-6838 -3-11 
| IV 24 | 0-872 0-732 0-5692 — 5-67 0-6948 — 2-67 0-285 0-30 0-7370 — 1-66 
. | = 0-881 0-728 0:7554 | -1:16 | 0°8155 +0-13 0-082 0-08 0-7823 — 0:58 
VI | 36 0-857 0-745 0-7959 we 0-22 08166 | +038 0-019 0:02 | 08130 +0-28 
| ' | ' | 


Values of w,, from the untransformed observations and from the transformed 
observations, both following Bartlett (ie. the transformation (#+4)*) and 
following the line suggested in the present paper, are shown in Table 9 for the 
field data of Tables 1-6. For the second transformation the values of k as calculated 
from (14) are shown as well as the nearest value of k entered in Table 8. For 
each experiment the value of nN and also the 0-05 limits of probability, from 
Geary (1935), are shown. There are also shown the departures of observed w,, 
from the expected value in terms of the standard deviation, a useful criterion 
since the distribution of w,, is almost normal. From Table 9 it can be seen that 
out of the three experiments in which w,, fell beyond the lower 5% limit of 
probability for the untransformed data and the data transformed as (x + })}, 
in only one experiment did w,, fall so with the final transformation. The results 
for Experiment II, in which w,, is decreased by the transformation, are peculiar. 
Consideration of the departures from the mean in terms of the standard devia- 
tion indicates more clearly the improvement effected by each transformation 





ed 


arture 


GEOFFREY BEALL 255 


and how the transformation suggested in the present work secures an improve- 
ment of the same, but more marked, character than that secured from the 
transformation of Bartlett. The results suggest that while homoscedasticity may 
not be attained always, it will be approached by means of the proposed trans- 
formation. 


6. THE EFFECT OF THE TRANSFORMATION ON THE ANALYSIS 
OF VARIANCE 


As was indicated at the beginning of § 5, our proposed transformation besides 
making the variability within a block for a repeated treatment the same for all 
treatments and blocks, should also provide quantities satisfying the xssump- 
tions underlying the analysis of variance. Since it is not quite clear how, in so far 
as the transformation is satisfactory in the first way, it will necessarily be satis- 
factory in the second, it will be well to consider directly the suitability of our 
transformed values for the analysis of variance. 

In the application of the analysis of variance one would deal with x,; rather 
than with x,;, and suppose that 

a; =A+B;+C,+D,;, (16) 
where A is a contribution from the general level of population on the experi- 
mental area, B; the contribution of the ith treatment and C; the contribution of 
the jth block. The remainder term, D;;, is called the interaction of treatments and 
blocks. Of course, the present discussion on the untransformed values, 2;,;, 
holds for the transformed values, 2/; = }(2/;;+2j;2) when the appropriate 
symbols, A’, B;, C; and Dj; are used. 

In material satisfying the conditions underlying the analysis of variance, for 
the observations under each treatment, the calculated squared standard devia- 
tion is , 1 sN . = 
ait a 25. — 8)" (17) 


Following the argument of the analysis of variance, x,;; —2,;_, of which the mean 
is 0, is an estimate of C;+ D,;, in which the two terms are independent; hence the 


ati 2 is _ 2 
expectation of sj is o? = 03, +0%,,, (18) 


where o>; and op, are the standard deviations of the parameters, C; and D;;, 
respectively, and are independent of treatment. Accordingly, s; should be in- 
dependent of treatment and distributed as an estimate ofo;, having N — 1 degrees 
of freedom. Conversely, if 2,;, cannot be built of the independent terms of (16), 
then the various values of s; will not be distributed as estimates of a single 
standard deviation. The hypothesis that the values of s; in any one experiment 
are estimates of one quantity may be tested.+ 


+ From correspondence with Dr R. W.B. Jackson, the writer has learned that he had arrived 
independently at the same test. 








256 Transformation of data from entomological field experiments 


The results of the tests on the homogeneity of the values, s;, within the six 
experiments treated in the present paper are presented in Table 10, where the 
value of the Z, criterion is shown for the original data and for the transformed 
values together with the appropriate 0-05 and 0-01 levels of probability (Nayer, 
1936). From Table 10 it can be seen that of the values of L, obtained from the 
original data, all but one are near or beyond the 0-05 level of significance, but 
that after transformation all are moved in to less significant values. Accordingly, 
the values of s;, when calculated from the original data, appear heterogeneous but 
the corresponding values obtained after the transformation appear homogeneous. 
Thus it is more probable that the analysis of variance is applicable to the trans- 
formed data than to the untransformed. 


Table 10. The homogeneity, as measured by the criterion L,, of the estimates 
S; for various values of i before and afier transformation 





| Experiment | 
1 2 3 4 5 6 | 
L, before transformation 0-867 0-833 0-325 0-657 0-344 0-680 
L, after transformation 0-864 | _0-941 | 0-766 0-688 0-813 0-730 
1 % limit 0-757 0-757 0-604 0-542 0-514 0-583 
5 % limit 0-812 0-812 0-707 0-656 0-648 0-673 


in Table 10 we have tested the homogeneity of the estimates, s;, as in §5 we 
tested the homogeneity of s,;;, that is without reference to the values of the 
associated means. In view of our original assumptions we are, however, interested 
in the possibility that the standard deviations, as calculated, might show every 
sign of being estimates of a common standard deviation and yet be dependent 
on the associated means. Accordingly, we have investigated such dependence 
roughly by fitting by least squares a first order regression of s; on 2;.. From this 
fitting we record the sign of the regression as follows: 


Experiment | 


Before transformation +* + ** +* + +4 + 


After transformation _ + ie * it | Ee 





GEOFFREY BEALL 257 


By a single asterisk we have indicated cases where the reduction in variability 
effected by the regression passed the 5°, probability limit and by a double 
asterisk where it passed the 1 % limit. Several points may be noted. (1) In two 
cases (Experiments 2 and 6) after transformation the residual sum of squares 
about the regression was greater than the reduction in squares due to the regres- 
sion, whereas it was consistently less before transformation. (2) As can be seen 
above, the regression generally did not effect a significant reduction in variability 
after transformation but did before (the small number of degrees of freedom made 
high significance difficult of attainment). (3) After transformation the sign of 
the regression seemed to be a chance matter, whereas before transformation it 
was consistently positive. These results suggest that the transformation pro- 
posed did tend to make the variability within a given treatment independent of 
the mean for that treatment. 


7. THE EFFECT OF TRANSFORMATION UPON THE CONCLUSIONS 
FROM THE ANALYSIS OF VARIANCE 


It has been shown in §§ 5 and 6 that the analysis of variance can be made on 
entomological data when a suitable transformation has been effected. It is of 
practical interest to see what numerical effect such transformation will have upon 
tests on the significance of, ‘say, the effect of treatment and the significance of 
differences for treatments. 

First, consider the numerical results to be obtained from the analysis of 
variance (1) without and (2) with transformation. Thus the mean square ascrib- 
able to blocks, treatments and their interaction is shown in Table 11, for six 
experiments of which the data are given in $2; parallel results are presented for 
untransformed observations and for observations transformed by (10) with the 
values of k from Table 9. To facilitate the comparison of the results, the mean 
square for blocks and for treatments is expressed in terms of the estimate for 
interaction, as the F of Snedecor (1934), and presented in each case. The trans- 
formation of the data has modified the conclusions to be drawn from the analysis 
of variance in Table 11, in that there are considerable changes in the criterion, F, 
for treatments or for blocks. In the examples shown the effect of treatments was 
highly significant in all cases and so the changes introduced by transformation 
did not alter the conclusions, as would have been the case for less definite effects. 

Consider next the effect of transformation on the significance of differences 
between the means for treatments as tested by the criterion, ¢, calculated with 
such estimates of mean square as the interaction of Table 11. For illustration, 
values of t, from the data on Leptinotarsa decemlineata (Experiment IIT), are 
shown in Table 12 for each possible comparison of treaments when untransformed 
data are used, when the transformation, (x + 4}, as suggested by Bartlett (1936a) 
is used, and when the transformation, k~! sinh—! (kx), as suggested in the present 
paper is used. In order that the influence of the level of population under each 











Table 11. The analysis of variance of untransformed and 
transformed data in six experiments 





Transformed data 

















| 
Degrees | 
Variation ie a = 
| freedom | Mean | F | Mean | F 
square | | square | 
Experiment I. P. nubilalis 
Between blocks | 9 | 92-8 | 1-21 | 0-565 | 1-66 
Between treatments 6 | 2,839-0 36:95 | 7-51 | 22-03 
Interaction 54 76-8 | — | 0-341 | — 
Experiment II. P. nubilalis 
Between blocks ! 9 | 577-0 | 6-72 | 5:37 10-55 
Between treatments 6 1,721-0 20-04 8-69 17-07 
Interaction ee 859 | — 0-509 — 
Experiment III. L. decemlineata 
Between blocks 6 20,172-0 2-26 4-67 3-03 
Between treatments 3 390,932-0 43-77 111-5 72-16 
Interaction 18 | 8931-0 | — 1-54 — 
Experiment IV. L. decemlineaia 
Between blocks 5 12,960:0 | 2-06 2-06 1-96 
Between treatments 3 124,054-0 20-40 20-40 | 19-41 
Interaction 16 | 6,727-:0 | 1-05 — 
Experiment V. P. quinguemaculata 
Between blocks 5 | 27-4 | 2:74 | 0-349 4-19 
Between treatments 2 752-0 75:33 | 19-5 233-77 
Interaction | 10 | 9-98 0-083 | — 
Experiment VI. P. quinquemaculata 
| Between blocks 5 41-7 4:13 1-50 4:12 
Between treatments 5 66-2 | 655 | 3°33 9-14 
Interaction 25 10-1 -- 0-365 = 


EE 





Table 12. The values of t, in the comparison of means, as calculated from the 
untransformed and the transformed data of Experiment III on Leptinotarsa 











decemlineata 
ps ante | | a 
Means for | - | 
Comparison untransformed | ; — | bays = : ey nn) 3 
| pare transformation | (x+ 4) k- sinh! (xx)} 
eae LPAI MUNN 
Ly Le 362 30 | + 9-27** | +11-33** + 9-92** 
pa 362 224 | +3-84** | + 3-52** + 2-11* 
Ly Ly 362 11 +9-82** + 12-89** + 12-46** 
L_ Xs. 30 224 — 5-43** — 7-81** | — 7-81** 
Xo, -X 30.11 + 0-54 | + 1:56 + 2-54* 
XL, 224 «11 | + 5-98** |} + 9-37** | + 10-35** 





GEOFFREY BEALL 259 


treatment may be judged, there are shown, also in Table 12, the means for the 
untransformed data. The values of ¢ falling beyond the 0-01 level of significance 
have been marked with two asterisks and the values beyond the 0-05 level with 
one. It can be seen that the transformation resulted in a profound alteration in 
the conclusions. Apparently on account of the dependence of variance on mean 
in untransformed data, the pooled estimate of variance was originally too iow 
for the treatments which resulted in high populations and too high for the treat- 
ments which resulted in low populations. Thus, in the comparison of the first 
and third treatments, which appeared to have the two highest surviving popula- 
tions, the value of ¢t calculated from untransformed values was high. In the other 
extreme case, the comparison between the second and fourth treatments, the 
value of t, as calculated from untransformed values was very low. It can be seen 
further, that the first transformation only secured in part the modification in 
the value of ¢ that was secured by the second transformation. 


8. THE PROCEDURE OF TRANSFORMATION IN PRACTICE 


The methods which were found applicable in the preceding discussion will 
now be illustrated in the transformation of the data shown in Table 7 (Experiment 
VII) on Phlegethontius quinquemaculata, of the same type as the experiments 
previously discussed in the present paper. The steps in the analysis will be set out 
with the purpose of providing a model for procedure in estimating the constant, 
k, which will be used to effect a transformation of the data so that the analysis 
of variance may be made. 

Supposing that the experiment has been laid out with a repetition of each 
treatment in each block, the procedure of estimating / makes it first necessary to 
find the sum and the absolute difference of each pair of plots subjected to a given 


n N 
treatment ina given plot and then to sum thesums, Y) > (*;;; + %j2),and the sums 
n N i=1j=1 n N 
squared, }) > (2;;,+2;;9)*, and also the differences squared, Y >} (*j;;—2;;2)", 
i=1j=1 i=1j=1 


over all such pairs and by substituting the results in (14) to find k. In.the case 
being used for an illustration the two plots subjected to the first treatment in 
each block gave respectively 10 and 7, 20 and 14, 14 and 12, 10 and 23, 17 and 20, 
14 and 13, so that 

N 

EY (eyj2 + Lyjo) = (10+ 7) + (204 14) + (144 12) 4... = 174. 

j=1 
Similarly, 
N 
D (251 + Xyzq)® = (10 + 7)? + (20+ 14)? +... = 5308 


j=1 
and similarly, 
N 

YX (a5 — Fz)” = (10 — 7)? + (20— 14)? +... = 228. 
1 


j= 











260 Transformation of data from entomological field experiments 


Of course, in estimating k the summations are not limited to one treatment but 
must be extended over all in the experiments. If this is done we find 

n N n N n N 

= hin +Xjj2) = 684, Py & (isn + Xiz2)* = 19,656, > DX (X51 — Liz)” = 708. 


i=lj= i=1j=1 
2(708 — 684) 

19,656 
and referring to Table 8, p. 250, use k = 0-00 as the nearest value occurring there. 
Of course, in this case, the transformation is simply z!. 

Now from the above result it will be possible to replace the observed values of 
Table 7 with the corresponding transformed values from the first column of 
Table 8. Thus in Table 7 replace in the first row: 10, 20, 14, 10, 17 and 14, by 
3°16, 4-47, 3-74, 3-16, 4-12 and 3-74. With such transformed values we can now 
proceed to carry out a routine analysis of variance which will be facilitated by 
working with the sum for each pair of plots in a given block with a given treatment. 
For example, the final analysis of variance for Experiment VII would be carried 
out with the values of Table 13. 


From (14) we estimate k= = 0-002, 


Table 13. T'ransformed and summed values to be used in the analysis of 
variance for Experiment VII on P. quinquemaculata 











| Block | 
| Treatment ) ‘ 2 | 
| 1 | 2 | 3 | 4 5 6; | 
sr _ 
1 581 | 821 | 7:20 | 7:96 8-59 7:35 | 
2 744 | 790 | 374 | 8-24 8-94 626 | 
3 | 100 | 4-06 2-73 2-41 1-73 3-00 
4 | 3-97 5-91 3-73 4-48 4-48 3-41 
| 5 | 397 | 397 | 418 2-09 314 | 4:45 
6 | 632 | 8-56 7:87 6-77 10-20 8-51 





9. SUMMARY AND CONCLUSIONS 


The foregoing work is a study of experimental results from seven field experi- 
ments on the control of insects. In such data, the standard deviation of the 
number of insects per plot varies with the mean. By the transformation, 
xz’ = k-*sinh— (kx)*, where k is a constant and x an observation, the data were 
put in a form for which the standard deviation approached a constant independent 
of the mean. The estimation of the one constant, k, necessary for the transforma- 
tion was made possible by the design of the experiments with repetition of treat- 
ments within blocks. In practice, the transformation gave good results so that 


GEOFFREY BEALL 261 


analysis of variance could be made. From the analysis of the transformed data, 
the results were found to differ markedly from those which would have been 
obtained from the untransformed data. 


ACKNOWLEDGEMENTS 


The present work was suggested to the writer by Prof. E.S. Pearson to whom 
the writer is further indebted for advice as the work progressed. The writer is 
further beholden to Dr R.W.B. Jackson, to Dr B.L. Welch and to Dr M.S. 
Bartlett for criticism and advice. Dr G.M. Stirrett and Mr F. A. Stinson very 
kindly permitted the use of unpublished data from their experiments carried out 
at Delhi, Ontario, on Phlegeihontius quinquemaculata Haw. 


APPENDIX 


As has been said, the transformation of (10) was suggested by the method used 
by Tippett (1934, p. 61). The procedure is as follows. 

It is required to find x’ = f(x), such that the standard- deviation, o, of 2’, 
shall be approximately constant. Let us write 


x’ = f(M)+f'(M)(x-M)+..., (19) 
where M is the expectation of x and whence, approximately, 
(x’— M’) = f'(M)(«- M), (20) 


where M’ is the expectation of x’. Hence 
ot = {f"(M)}20%, (21) 
where o is the standard deviation of the observations, x. Replacing o, in (21) 
by a constant, c, as is the purpose of our operation, and substituting for o from 
equation (6), p. 247, we have 
f'(M) = c(M+kM?)-4, (22) 
where k is, as has been previously discussed, a constant peculiar to our data. 
Integrating in (22), 
f(M) = 2ck- sinh! (kM)}. (23) 
From (23) the form of the function suggested is sinh (kx)*, but it is wise instead 
to use k~! sinh (ka)!, since the transformation then becomes identical, as shown 
in (11), with the established transformation, x', when k = 0. 

As Tippett (1934) says: ‘This derivation is not mathematically sound, and 
the result is only justified if on application it is found to be satisfactory.’ The 
writer would have hesitated to have used it had it not already led to useful 
transformations in cases analogous to the present, namely to x! where x comes 
from a Poisson distribution, to sin~! p! where p comes from a binomial distribu- 
tion and, according to Tippett, to tanh~'r. where r is the correlation coefficient. 











262 Transformation of data from entomological field experiments 


REFERENCES 
Barttett, M. 8. (1936a). Square root transformation in analysis of variance. J. Roy. 
Statist. Soc. Suppl. 3, 68-78. 


—— (19366). Some notes on insecticide tests in the laboratory and in the field. J. Roy. 
Statist. Soc. Suppl. 3, 185-94. 


BEALL, G. (1935). Study of arthropod populations by the method of sweeping. Ecology, 


16, 216-25. 

— (1939). Methods of estimating the population of insects in a field. Biometrika, 
30, 422-39. 

—— (1940). The transformation of data from entomological field experiments. Canadian 
Ent. 72, 168. 


Beat, G., Strrrett, G. M. & Conners, I. L. (1939). A field experiment on the control 
of the European corn borer, Pyrausta nubilalis Hubn., by Beauveria Bassiana Vuill. IT. 
Sci. Agric. 19, 531-4. 

Becker, G. F. & VAN ORSTRAND, C. E. (1931). Hyperbolic functions (4th reprint). Smith- 
sonian Mathematical Tables. i 

CiapHaM, A. R. (1936). Over-dispersion in grassland communities and the use of statistical 
methods in plant ecology. J. Ecol. 24, 232-51. 

Geary, R. C. (1935). The ratio of the mean deviation to the standard deviation as a test 
of normality. Biometrika, 27, 310-32. 

Naver, P. P. N. (1936). An investigation into the application of Neyman and Pearson’s 
L, test, with tables of percentage limits. Statist. Res. Mem. 1, 38-51. 

SNEDECOR, G. W. (1934). Calculation and Interpretation of Analysis of Variance and 


Covariance. Pp. 96. Ames, Iowa: Collegiate Press Inc. 

‘SruDENT’ (1919). An explanation of deviations from Poisson’s law in practice. Biometrika, 
12, 211-15. : 

Tippett, L. H. C. (1934). Statistical methods in textile research. Part 2. Uses of the 
binomial and Poisson distributions. Shirley Inst. Mem. 13, 35-72. 


Wiis, C. B. (1937). The use of logarithms in the interpretation of certain entomological 
problems. Ann. Appl. Biol. 24, 404-14. 





INTERPOLATION FOR FRESH PROBABILITY LEVELS 
BETWEEN THE STANDARD TABLE LEVELS 
OF A FUNCTION 


By J. B. SIMAIKA 


1. THE PROBLEM 


A NUMBER of tables of probability functions exist, and more will no doubt before 
long be available, giving values for a variable x corresponding to a limited number 
of simple probability levels a. How far is it possible to obtain x rapidly for inter- 
mediate values of «? 

The variable may be put into standardized form as the ratio of the deviation 
from the mean to the standard deviation; these two quantities (i.e. the mean and 
standard deviation) are often easy to obtain, whereas the probability integral 


may require extensive computation. Denote by u, the standardized variable 
so that 
ee (1) 
* standard deviation of x 
The question we shall consider is this: having full and accurate tables relating 
u, and a for a standardized normal variable, denoted by U,, can we use these 
values as auxiliary in obtaining u, for any other function tabled only at a few 
probability levels and, having found aninterpolating formula, what isits accuracy ? 
In examining this point we shall compare the accuracy of the method with that of 
some other methods of deriving intermediate values of u,. 


2. GENERAL APPROACH 


It will be useful to consider first how far a general theoretical approach will 
take us. Let the variable x follow a probability law defined only by its cumulants 
K, (r = 1, 2,...); then the first four cumulants of the variable u become 0, 1, 7, Y2- 
It is known that the relation between u, and « may be written symbolically 


Ua 1 " 1 d : d4 sgh q : 
"™ _w (am) — 6Vigzs t 24%aquat - e x, (2) 
while the same relation for a normal variable is 
a l e hr? dx (3) 
a= naa 4 
—« (277) 


Using equations (2) and (3) it has been shown by Cornish & Fisher (1937) 
that wu, can be approximated to by a parabolic curve in U, and vice versa. If we 











264 Interpolation for probability levels 


assume that «, for r > 5is negligible, this expression, using a fifth degree parabola, 


is u, = A+ BU, +CU2+DU3 + EUS + FUS, (4) 
where A —$¥1—1271%2+ 32471 

B=1-$¥etgevi —servs —2eeViVe—F7eyD 

C= §%t8571%2—- 3247p 

D= HvVe-esVi t+treYi —2887772+ HN: 

E= —gV1 Vet enYi 

F ae 


= —re8Vi + 1da7ViYe—2te71- 

Now as y, and y, tend to zero, B tends to unity and all other coefficients tend 

to zero, i.e. the curve of uw, as a function of U, tends to the diagonal line 
u, = U,. 

Figs. 1-4 give these curves for different numbers of degrees of freedom for the 
commonly used statistical variables y*, y, ¢ and v (a transformation of z referred 
to below), expressed in standardized form. 

With regard to the coefficients in (4) it may be remarked that large values of 
Y2 do not increase them as much as large values of y,. Furthermore, when y, is 
zero, the coefficients A, C and £ vanish and the expression (4) becomes 


u, = BU, +DU3 + FU, 


or 


B+ DU2+ F{U2}, (5) 


These broad results suggest that a good method of interpolation, when both y, 
and y, exist, is a Lagrange formula through the points 
(u,,»U,,) (t= 1,2,...). 

Remembering that u,, = (x,,— mean x)/(S.D. of x), from the practical point of 
view the interpolation can be carried out more expeditiously and without loss 
of accuracy by using a Lagrangian formula through 

(%.,,U,,) (¢=1,2,...). 
When, however, 7, is zero as in the case of the ¢-distribution it would be better to 
take the Lagrange formula through the points 


(7 v2) or alternatively through (; ~_- v2) (s = 1,2, ...), 
a, 
the mean x being zero. 

The accuracy of the method is likely to depend on the value of y, and y,. 
For example, as seen in Fig. 1, linear interpolation between, say, Ujo; and Up.os 
will be more accurate with vy = 18 than v = 3, the y’s being smaller in the former 
case. Again, the curves will be more nearly linear if we take as variable 


u =(x—meany)/o, rather than w= (y?—mean y’)/o,2 


because the y’s for the former are the smaller. 











J. B. SmMAIKA 265 


For the practical worker it will often be sufficient to use linear interpolation, 
i.e. to make use of two tabled probability levels only. For more accurate work 



















-0 1-5 2-0 2-5 
ee ee ee ee ee een 
q ' Upper End 7 
l of 
3-0L Distribution 7 3-0 
4g a 
2:5 425 
3 + z 
e q 
3 
—E 20-F 4-0 
a) 
Z 
i Lower End 
a of 4 
L ; Distribut ion 
65 “a 415 
& 2 
1-0 l 1 1 l i 1 = | i l , 410 
1-0 1-5 2-0 2-5 


Standardized normal variate, U 


Fig. 1. Relation between u(x?) and U. 


three or more levels can be used, but an increase beyond this is not in fact likely 
to lead to a gain in accuracy which will be worth the labour. 








266 Interpolation for probability levels 


3. CoMPARISON OF METHODS 


To compare the accuracy of the interpolation based on the polynomial 
expansion (4) with that obtained by other possible methods, we have considered 
the x?, ¢ and wv (beta)-probability distributions. For each, different methods of 
interpolation have been devised. In some a transformation of the variable has 

















1-0 5 2-0 2°5 
ee st a 
a @ Upper End 
Wh of 
4 | 8Distributio 
2-5 
. a 
g 4 
& 
& 
> 2-0 Ss 
~ 
7 a 
= 
=z 4 
EI 
= 
= 
Ss bee 
Rn 
5 
1-0 — 
1-0 5 2-0 25 


Standardized normal variate, U 
Fig. 2. Relation between u(x) and U. 


been used, while in others a transformation of the argument or a completely new 
argument has been considered. 

The accuracy of each method in the range covered by the values a < 0-10 
and a > 0-90 has been tested in the following way: between any two consecutive 
probability levels a number—not less than three—of intermediate values have 
been interpolated. These values were chosen to be those which could be obtained 
accurately from some other table. The greatest deviations in each interval are 





giv 
of 








J. B. SmMArKA 267 


given in Tables 1-3. The methods have been arranged according to the degree 
of accuracy obtained. 

















1-0 1-5 2-0 25 
T T T T i a T T T T T Ty TT 
a 1 _ 
1 
_ I 4 
3-0F 43-0 
25+ 425 
3 
$s y a 
& 
= 
a - 4 
> 
ab ! 
3 q | 7 
a | 
z 2-0Fr 42:0 
z \ 
| q a 
$ 
RD wn 
a - a 
|= 
f—) 
Ss | il 4 
8 
| 
S- | 4155 
| 
s | 4 
L Ts, Is a 
fy, IF 
5 | 3 7 
> 
o 4 
IF 
1-0 13 41-0 
l l lL l l l 1 1 i l l l l 1 l 
1:0 1-5 2:0 2-5 


Standardized normal variate, U 


Fig. 3. Relation between u(t) and U. 


It remains to point out that, ifthe n tabulated probability levels are denoted by 

Ay <Ay<...< a, < 0°10, 
and if the interpolation is to be carried by using a quadratic or higher expression 
in the argument, it has been found that the interpolated value is always more 


Biometrika xxx 18 








268 Interpolation for probability levels 
accurate when the probability levels used include as many as possible of the 
probability levels below ~,. Similarly, if 

0:90 <a, <_<... <Q, 


it is better to use probability levels including as many as possible of those above «,. 


= =—o -2-0 — 26 





- 3-0 = 





| 
tN 
w 
| 


Standardized v-variate, wv 


| 
1 














Standardized normal variate, U 


Fig. 4. Relation between u(z) and U, where p(v)= Cv’! (1—v)!71. 
4. THE y?-PROBABILITY FUNCTION 
The standardized form of y? used in Fig. 1 is 


6 - p A 
u(y") = 5 ; 6 
x") Var) (6) 


v being the number of degrees of freedom. The y, and y, of this distribution are 

















n 2 


an 





J. B. StIMAIKA 269 


/(8/v) and 12/v respectively. If we consider x itself instead of x? we find that its 
standardized form has the approximate value, used in Fig. 2, 





i (7) 


and its y, and y, are (2v)-'+O(v-*) and O(v-*) respectively. Both these last 
quantities are smaller than those for x*, which suggests that interpolation will 
be more accurate using y rather than y*. This can also be seen from Figs. 1 and 2. 

The x? probability levels have been tabulated by Fisher (1941) for « = 0-01, 
0-02, 0-05, 0-10, ..., 0-90, 0-95, 0-98, 0-99 and are given to three decimal places. 
These levels were used in the interpolation.* The accuracy of the interpolated 
values obtained by the eight methods detailed below was checked either from the 
Tables of the Incomplete Gamma Function (Karl Pearson, 1922) or from Tables 
or Statisticians and Biometricians, Part I, Table XII (Karl Pearson, 1930). 

The greatest deviations, 6, (m = 1, 2, ...,8), obtained using in all eight different 
methods, are given for v = 3,5,9 and 18 and for intervals of a: (0-01, 0-02), 
(0-02, 0-05), (0-05, 0-10), (0-90, 0-95), (0-95, 0-98) and (0-98, 0-99) in Table 1. 


Method 1. 
x2=A+Bloga (a<0-10), x? = A+ Blog(l—a) (a#>0-90). 
Method 2. 
u(y) =A+Bloga («<0-10), u(x) = A+Blog(l—a) (a>0-90). 
Method 3. u(x”) = A+ BU. 
Method 4. 


u(x?) -U = A+ Bloga («#<0-10), u(y?)-—U = A+ Blog(1—a) (a>0-90). 


Method 5. u(y) = A+ BU. 
Method 6. x? = A+ Bloga+Clog?« («<0-10), 
x? = A+ Blog (l—a)+Clog?(l—a) (a>0-90). 
Method 7. u(x?) = A+ BU+CU?. 
Method 8. u(y) = A+ BU+CU?, 


The best linear interpolation is that provided by method 5 and the best 


quadratic one is given by method 8. Both these methods are those suggested by 
the general approach. 


* Certain additional levels are given in a recently published table computed by Catherine M. 
Thompson (1941). 








270 


Interpolation for probability levels 

































































Table 1. Greatest deviations in the interpolation of x* and x. 
| | | 
v Tx | Me Interval | a ieee |6,| | |d,| |d5| |d4| |45| | d,| |4;| |3,| 
| 
3 | 1-63 | 4-00 | (0-02, 0-05) | 0-02 997 0-245 | 0-006 | 0-007 | 0-011 | 0-012 | 0-004 | 0-003 | 0-003 | 0-001 
(0-05, 0-10) | 0-07 890 0-490 15 8 10 1 4 4 3 0 
(0-90, 0-95) | 0-92 810 | 7-000 4 3 27 14 5 1 0 l 
(0-95, 0-98) | 0-97 O71 | 9-000 5 33 36 10 6 0 0 2 
(0-98, 0-99) 0-98 827 11-000 1 13 34 5 | 1 2 1 2 
| 
5 | 1-26 | 2-40 | (0-01, 0-02) | 0-01 353 | 0-632 8 5 6 3 2 6 0 0 
(0-02, 0-05) | 0-03 340 | 0-949 23 i3 15 3 5 5 1 0 
(0-05, 0-10) | 0-06 150 | 1-265 19 ll ll 0 3 4 0 1 
(0-90, 0-95) | 0-92 808 | 10-119 ll 32 26 12 5 3 l 1 
(0-95, 0-98) | 0-96 544 | 12-017 13 39 32 12 6 2 1 0 
(0-98, 0-99) | 0-98 579 | 14-230 5 18 17 6 4 2 1 1 
9 | 0-94 | 1-33 | (0-01, 0-02) | 0-01 060 2-121 4 2 2 2 0 1 1 0 
(0-02, 0-05) | 0-03 452 2-970 34 21 19 0 6 6 0 1 
(0-05, 0-10) | 0-07 705 3-818 33 24 17 6 5 8 1 0 
(0-90, 0-95) | 0-93 312 | 16-000 19 38 17 ll 5 4 0 0 
(0-95, 0-98) | 0-96 483 | 18-000 22 46 31 12 7 4 l 1 
(0-98, 0-99) | 0-98 735 | 21-000 8 18 14 5 4 2 1 0 
18 | 0-67 | 0-67 | (0-01, 0-02) | 0-01 167 | 7-200 14 9 5 2 1 4 1 1 
(0-02, 0-05) | 0-02793 | 8400| 47| 32 20) 2 5 7\ 0 1 
| (0-05, 0-10) 0-07 482 10-200 | 48 | 34 18 5 | 6 ll ze} 0) 
| (0-90, 0-95) | 0-93 159 | 27-600 | 34 52| 25| 10| 5 7 | 1 | 1] 
| (0-95, 0-98) | 0-96 255 | 30-000 34 55 | 28 10 | 6 6 1 1 | 
| | (0-98, 0-99) | 0-98 589 | 33-600 15 | 26 13 5 4 3 1 | 1 | 
For methods associated with subscripts to 4, see p. 269. 
Table 2. Greatest deviations in the interpolation of t 
| 
" |} eo Interval @ Truet| |6,| | 63 | 33 | | 3,| | 35 | |S] | |3,| | | 35 | 
a Sew a Ce gee | 
3 co (0-005, 0-01) 0-00 692 | 5-196 | 0-039 | 0-040 | 0-035 | 0-020 | 0-011 | 0-008 | 0-007 | 0-004 
(0-01, 0-025) | 0-01 430 | 3-969 64 52 42 21 15 | 7 6 | 2 
(0-025, 0-05) 0-03 261 | 2-848 28 24 16 7 8 5 | 3 0 
(0-050, 0-10) 0-07 285 | 1-954 25 23 ll 4 | 7 3 | 3 | 1 
6 | 3-00 | (0-005, 0-01) 0-00 714 | 3-413 ll 8 5 3 | 2 1 1 | 1 
(0-01, 0-025) | 0-01 517 | 2-820 18 14 6 | 3 2 1 | 1 1 
(0-025, 0-05) | 0-03 874 | 2-128 8 6 1 | l 2 2 | 0 | l 
(0-050, 0-10) | 0-06 820 | 1-719 9 5 os 4a 1 1} 0 | 0 
| | | 
10 | 1-00 | (0-005, 0-01) 9-00 625 | 3-038 5 4 1 2 1 1 | 1 | 1 
(0-01, 0-025) | 0-01 639 | 2-476 8 ll 1 | 1 1 1 1 1 
(0-025, 0-05) 0-03 844 | 1-972 4 3 2 | z 1 1 | 0 | 0 
| (0-050, 0-10) | 0-07 246 | 1-581 4 4 5} 0 | 1 a (4 8 
| | ; 


















































For methods associated with subscripts to 6, see p. 271. 





J. B. SmwarKa 271 


5. THE t-PROBABILITY FUNCTION 


The standardized form of the t-probability function used in Fig. 3 is 


u(t) = IPF). (8) 


v being the number of degrees of freedom. The values of y, and y, are zero and 
6/(v—4) respectively. 

Percentage levels for t have been tabulated by Fisher (1941) for « = 0-005, 
0-61, 0-025, 0-05, 0-10, ... and are given to three decimal places. The level here 
defined as a is half the figure given in Fisher’s table, i.e. 

“te 


a = p(i) dt. (9) 


vo —-a 


The values of ¢ used in checking those obtained by interpolation are taken 
from T'ables of the Incomplete Beta-Function (Karl Pearson, 1934), where 


I,(p,q) = %, | 
1 , > 
= 1 , = 1 é 1 P = 
p=), q=%} and z +8, >| 


(10) 


v being the number of degrees of freedom. 

The greatest deviations found in the following eight methods are denoted 
by 6,, (m = 1, 2,...,8), and are given in Table 2 for vy = 3,6 and 10 and for the 
intervals (0-005, 0-01), (0-01, 0-025), (0-025, 0-05), (0-05, 0-10). 


Method 1. u(t) = A+ BU. 

Method 2. u(t) -U = A+ Bloga. 
Method 3. t= A+ Bloga. 
Method 4. u(t) = AU + BUS, 
Method 5. u(t) = A+ BU+CU?. 
Method 6. t= A+ Bloga+C log? a. 
Method 7. u(t)-U = A+ Bloga+ Clog? a. 
Method 8. u(t) = AU + BU3+ CUS, 


From the methods that make use of only two probability levels, method 4 
is the one that gives the highest accuracy and from those that make use of three 
probability levels, method 8 is the best. These two methods are those suggested 
by the general approach. 


6. THE v OR BETA-DISTRIBUTION 
This variable, which is related to R. A. Fisher’s z, is defined as 
S, Ve 
=e = ee (11) 
S, +8,  vetvye 








272 Interpolation for probability levels 


where S, is a sum of squares of normal variates based on v, degrees of freedom and 


S, an independent sum of squares based on v, degrees of freedom. The elementary 
probability law of v is 


pv) = {B(dry, $y2)}2 v1 — vy, (12) 


Table 3. Greatest deviations in the interpolation of v 











YyAiYe) Vi Ye Interval x True v 14, | |, | 13] | [bs] | [os] | 1] 
| | | 
RS os be Stee F ge 3 | 
| 4/20 —0-92 | +0-79 (0-005, 0-01) | 0-00 838 | 0-52000 | 0-00043 | 0-00016 | 0-00006 | 0-00007 | 0-00006 | 0-00005 
| (0-01, 0-05) | 0-02 607 |0-59000, -296 76 34 35 7 27 
| | | (0-05, 0-10) | 0-06 901 | 0-66000 79 5 28 14 6 10 
8|30| —0-61| +0-25 | (0-005, 0-01) | 0-00 769 | 0-53000 25 4 7 16 | 9 5 
| (0-01, 0-05) | 0-02 711 |0-59000,  ~ 228 36 24 35 4) 13 
(0-05, 0-10) 0-06 6450-64000 67 4 16 16 | 2 | 5 
eee | 
15 | 30 —0-28 | —0-12 | (0-005, 0-01) | 0-00 666 | 0-41000 31 | 3 3 8 9 7 
(0-01, 0-05) 0-02 826 |0-47000, 240 40 22 42 3 9 
4 (0-05, 0-10) 0-07 463 | 0-52000 78 | 9 1| 20 1 9 
= | 
| 8/15) —0-33 | —0-26 | (0-005, 0-01) | 0-00677 | 0-30000 53 19 20 13 | 3 3 
(0-01, 0-05) | 0-02 519 037000, 418 128 114 67 | ul 22 
| (0-05, 0-10) 0-06 9720-44000 126 27 16 29 2 15 
4) 5 —0-16) —0-77 (0-005, 0-01) 0-00 796 | 0-09000 80 32 58 29 16 5 
| (0-01, 0-05) | 0-02 7230-15000, 838! 375. 534 18] 93 19 


(0-05, 0-10) | 0-07 421 | 0-23000 278 123 144 78 33 8 


For methods associated with subscripts to 6, see p. 273. 


Low values of v correspond to high values of z. The standardized form of v 
used in Fig. 4 is - 
2 


“hak 
- , 13 
l ( 2), V2 ) (13) 
Vy+Vgn \Vy+¥gt+2 
and the values of y, and y, are 


4 4%, (* (Vy +V2 =m 
me Vy+Vo+4 V4Vp ‘ 


L2{v7(v, + 2) + V}(Yp + 2) — 2v, V9(V, +¥_+4)} 
v 1Y9(V; + V_+ 4) (vy +V_+ 6) 


u(v) = 





(14) 
fee 


The cases considered here are those most generally met in tests of significance 
with v, < v, and therefore y, < 0. y, is sometimes positive and sometimes negative. 

Tables giving values of v for «=0-005, 0-01, 0-025, 0-05, 0-10, 0-25 and 0-50 
to five decimal places have recently been published (Catherine M. Thompson, 
1941). The corresponding upper probability levels can be obtained by entering 
the tables with v, and v, transposed and taking 1—v for v. 


> = © & ee FO 





wr wo — C9 OO 


mm Rh or 





J. B. SrMarKA 273 


The values of v used in checking the interpolation are taken from the Tables 
of the Incomplete Beta-Function (Karl Pearson, 1934). 

The greatest deviations 6,, (m = 1,2,...,7), found in the following seven 
methods are given in Table 3 for the following pairs of values of vy, and r,: 
(4,20) (8,30) (15,30) (8,15) (4,5) and for the intervals (0-005, 0-01), (0-01, 


0-05), (0-05, 0-10) of a. 
Method 1. 


v=A+Bloga (a<0-10), v=A+Blog(l—a) (a>0-90). 
Method 2. 
u(vl)-U =A+Bloga (a<0-10), u(v) -U =A+Blog(l—a) (a>0-90). 
Method 3. u(v) = A+ BU. 
Method 4. v=A+Bloga+Clog?a (a<0-10), 


v= A+ Blog(l1—a)+Clog?(l—a) (x#>0-90). 
Method 5. u(v_) -U = A+ Bloga+Clog*x (a<0-10), 
u(v)—U = A+ Blog(l1—a)+Clog?(l—a) (a#>0-90). 
Method 6. u(v) = A+ BU+CU?. 
Method 7. u(v) = A + BU + CU?2+ DU3. 


Here also the best linear method and the best quadratic one are those sug- 
gested by the general approach, namely, methods 3 and 6. Method 7, a cubic in 
U, was used only because the great accuracy of the tabulated v-function justified 
the computation. This cubic interpolation gives errors of the order of 0-0001 
even for numbers of degrees of freedom as small as vp, = 4, vy, = 5. If the interval 
between a = 0-01 and 0-05 were broken into two parts at 0-025 (a level given in 
the Thompson tables) the corresponding errors 6, would be considerably reduced. 


7. SOME NUMERICAL EXAMPLES, USING LINEAR INTERPOLATION 
(a) Interpolation for a x? level. Suppose that we calculate the upper 23% 
level (« = 0-975) for x? with v = 5 degrees of freedom, using tabled values of the 
upper 2 % and 5 % levels. We require the 2%, 25 °,, and 5 % levels of the stan- 
dardized normal variable as well as the two x? levels. The relevant data are shown 
in the following table; as pointed out above, there is no need to calculate the 
standardized form of either y or x”. 





a Uz Xa Xe 
0-98 2-0537 13-388 3-659 
0-975 1-9600 os ? 
0-95 1-6449 11-070 3°327 








274 


Interpolation for probability levels 
By linear interpolation, i.e. using method 3 of p. 269, we have 


1-9600 — 1-6449 
Xzor5 = 11-070 + (13-388 — 11-070) x 39-0537 — 11-6449 
= 12-857. 


Interpolating similarly for x, i.e. using method 5 of p. 269, we find 
Xoo7s = 3°583 or X2o75 = 12-838. 
The correct value taken from Miss Thompson’s table (1941) is 
NXoors = 12-8325. 


It is seen, as expected on theoretical grounds and as evidenced in the comparisons 
of Table 1, that method 5 is the more accurate of the two. 

(b) Interpolation for a t level. Suppose that we calculate the value of ¢ corre- 
sponding to a = 0-0125 (as defined in equation (9)), with v = 6, using the tabled 
levels for « = 0-005 and 0-025. The data required are shown below, the values of 
t being taken from the table on p. 300 of this issue. 














a U. 4 t/U, | Ua 
——$$—_____ ——$_$_—_—_—___—_ —___—_—_—__—_ _ —_ } OT 
0-005 —25758 | -37074 | 14393 | 6-6347 
0-0125 | 22414 ? ? 5-0239 
0-025 | = 1-9600 — 2-4469 1-2484 3-8416 


| 





Again, it is not necessary to calculate the standardized values of t,, for even 
when using the relation of method 4, which assumes 


—_ = A+ BU?, 


o,U (15) 
the transference of o, to the right-hand side of the equation will only modify 


the constants A and B whose values are not directly determined in the inter- 
polation process. 


Using method 1 (¢ a linear function of U), it is found that 
beers = — 3-023. 

Using method 4 (t/U a linear function of U*), it is found that 
tooiss = — 2°979. 


The correct value is — 2-969. As can be seen in Fig. 3, from the stretch of the 
curve for v = 6 between a = 0-975 and 0-995, the interval chosen is too long for 
satisfactory linear interpolation. The use of formula (15) improves matters, but 
is still not satisfactory. With Fisher’s tables we could, of course, interpolate 
between the levels a = 0-010 and 0-025; doing this with method 4, it is found that 


to.o195 ==— 2-971 


a distinctly better value. 











Ve 
l { 
fr 





J. B. Sumarka 275 


(c) Interpolation for a beta-distribution percentage level. Take the case v, = 8, 
v, = 30 and suppose it is wished to find the 2} % level from a knowledge of the 
1% and 5 % levels. The data required are as follows, v9, and v9.9; being taken 
from Miss Thompson’s tables. 








a u. Vz 
| 
0-01 2-3263 0-54170 
0-025 1-9600 ? 
0-05 1-6449 0-62332 











Then, using linear interpolation, i.e. method 3 of p. 273, we have 


1-9600 — 1-6449 
= 062335 -62332 —0-5417 pai RE Mss 
Upoes = 062332 + (0-62332— 0-54170) x Soe aaa 
= 0-58558. 


The correct value taken from the same tables is 0-58582. It will be seen from 
Fig. 4 that the intervais a = 0-005, 0-010, 0-025, 0-050, 0-100 of Miss Thompson’s 
tables are likely to form a satisfactory framework for subtabulation if this is 
needed. 


8. SUBTABULATION OF EXISTING TABLES 


These methods have been used to produce the following enlargements of 
existing tables; but it is not at present possible to arrange for their publication. 

(a) Table of x* percentage levels. Method 7 (p. 269) was used, as it is almost as 
accurate as method 8 and less laborious. The table calculated gives x2 to 3 
decimal places for vy = 1 (1) 30, and for a = 0-010 (0-002) 0-020; 0-020 (0-005) 0-050; 
0-05 (0-01) 0-10; and for the corresponding levels at the upper end of the distribu- 
tion, i.e. for a’ = l1—a. 

(b) Table of t percentage levels. Method 8 (p. 271) was used for v = 3, ..., 6, and 
method 6 for vy = 7 (1) 30; 40, 60 and 120. Exact levels were calculated for v = 1, 2. 
The tables were computed to 3 decimal places and for 


a = 0-005 (0-001) 0-010; 0-0100 (0-0025) 06-9250; 0-025 (0-005) 0-050; 0-05 (0-01) 0-10. 


9. INTERPOLATION FOR THE PROBABILITY INTEGRAL &, GIVEN U,- 


As we have already mentioned, the variable U, can be expressed as a poly- 
nomial in u,. This suggests that interpolation for U,, given u,, is as easy and as 
accurate as the interpolation discussed above. Hence to interpolate for a, given 
the value of wu, and certain tabled levels u,,, v,,, ..., we first interpolate for U, 
and then find the value of « from appropriate tables of the normal probability 
integral, e.g. Tables for Statisticians and Biometricians, Part I, Table IT. 








276 


Interpolation for probability levels 


10. CoNcLUSION 


It has been shown how accurate values of the probability levels of a statistical 
variate may be interpolated between standard tabled values by using the stan- 
dardized normal variate as auxiliary. For many purposes linear interpolation is 
adequate; for others a second order Lagrangian formula may be preferred. The 
accuracy of the result depends, of course, on the closeness of the actual prob- 
ability law to the normal law and on the size of the intervals between the tabled 
levels. The method has been illustrated on examples from the y?, ¢ and v (beta) 
distributions. 

The method has been used to provide a subtabulation of existing tables of 


percentage points for x? and ¢, but it is not possible to have these tables printed 
with the present contribution. 


Finally I should like to express my thanks to Dr B. L. Welch of University 
College, London, for originally suggesting the problem to me. 


REFERENCES 
CornisuH, E. A. & Fisuer, R. A. (1937). Moments and cumulants in the specification of 
distributions. Rev. Inst. Int. Statist. no. 4. 


FisHer, R. A. (1941). Statistical Methods for Research Workers, 8th Edition. Edinburgh: 
Oliver & Boyd. ‘ 


Prarson, K. (1922). Tables of the Incomplete Gamma-Function. 
(1930). Tables for Statisticians and Biometricians, Part I. 





—— (1934). Tables of the Complete and Incomplete Beta-Function. 
THOMPSON, CATHERINE M. (1941). Biometrika, 32, 168-81 








con: 
wit! 
Thi 
whi 
wh 
Wo! 
if a 
ant 
thi 


fol 








PARTIAL RANK CORRELATION 
By M. G. KENDALL 


1. In interpreting an observed dependence between two qualities we are 
constantly faced with the question whether an association (correlation) of A 
with B is really due to the associations (correlations) of each with a third quality C. 
This has led naturally to the theories of partial association and correlation, 
which attempt to decide the matter by the consideration of subpopulations in 
which the variation of C is eliminated. An analogous problem arises in ranking 
work but, so far as I know, has not previously been considered. For example, 
if a number of men are ranked according to mathematical and musical aptitude 
and there appears a significant rank correlation, it is natural to inquire whether 
this may be attributable to the correlation of both with some more fundamental 
quality such as intelligence. The object of this paper is to propose a coefficient 
of partial rank correlation which has a natural meaning and may be found useful 
for investigations requiring this type of decision. 

2. As a preliminary it may be worth examining what can be done in this 
direction with the Spearman rank correlation coefficient p. If there are three 
rankings denoted by 1, 2 and 3, we may find the three coefficients P,., P13, Pos- 
It is tempting to apply to these coefficients the formulae of product moment 
partial correlation such as 
_P23— Pi2Pi3 


031 = 7 = ; 1 
sas (1 —pjz)* (1 — pis) ©) 


and to define p.3, as the partial rank correlation of 2 and 3 ‘when 1 is constant’. 
There is clearly very little justification for such a procedure, and it is far from easy 
to explain just what p,,, means. In fact, the only defence of formula (1) that 
can bear critical examination is, I think, that it is an approximation to a second 
possibility, as follows: 

3. There can be no such thing as a rank correlation in a continuous population 
(the members of which are not even denumerable) but we can speak with genuine 
meaning of a grade correlation. A well-known result due to Karl Pearson states 
that in a normal bivariate population with correlation p, the grade correlation 
Py is given by nT 
p, = 2sin g Po (2) 


The Spearman coefficient p may be regarded as a sample grade correlation. If, 
therefore, we take p as an estimate of p, we may find p, from (2). For three rankings 
we may then obtain the three values of p,, apply the ordinary product moment 


: , ; ' : . 1x 
partial formula, and so obtain a partial coefficient. Since x and 2sin e do not 











278 Partial rank correlation 


differ by more than a small amount for |z|<1, we might even apply formula 
(1) direct to the values of p without bothering to transform them into p, by 
equation (2). 

4. Such a procedure, again, is open to fairly obvious objections. Apart from 
the all-too-facile assumption of normality and the error involved in using Spear- 
man’s p from a small sample to estimate the grade correlation in a parent, the 
fact remains that we arrive, not at a partial rank coefficient, but at an estimate 
of a partial product moment coefficient in a normal population. 

Perhaps there are cases where this is a reasonable objective based on reason- 
able assumptions but it is evidently unsatisfactory for general ranking purposes. 

5. In a previous paper (1938) I defined an alternative coefficient of rank 
correlation 7 which may be generalized to include the case when pairs of in- 
dividuals are compared separately (Kendall & Babington Smith, 1940). It will 
be convenient for present purposes to redefine 7 in a slightly different manner 
so that the results obtained below may again be immediately generalized to the 
case of paired comparisons. Consider the two rankings of six 


z 1 a 3 2 6 5 
3 2 4 1 5 6. 


(3) 
There are (;) = 15 possible pairs of ranks in each ranking, viz. 12, 13, ..., 16, 23, 


24, ..., 56. We write them down as in expression (4) below. Any order of the 
pairs will serve, and it is immaterial whether any pair is written as ab or ba; but 
for practical convenience they may be written in the natural order indicated in 
the last sentence but one. This arrangement I call the recorded order. 

We then consider the occurrence of each pair in the ranking 1. Ifa pair occurs 
in that ranking in the order in which we have recorded it, we write a plus below 
the recorded order underneath the pair concerned; in the contrary case we write 
a minus. Ranking 1 of expression (3) will then give 
Recorded order: (12) (13) (14) (15) (16) (23) (24) (25) (26) (34) (35) (36) (45) (46) (56) 
Ranking 1: > + + + + = = + + | 4S + += SF = 

(4) 

Here, for example, the pair (15) occurs in that order in ranking 1 and so is 
denoted by a +, whereas the pair (24) occurs as 42 and is denoted by a —. 

Consider now ranking 2. The members of ranking 1 which are ranked 1, 2 
correspond to members in ranking 2 ranked as 3, 1. This is in the reverse order 
to that of the pair 13 in the recorded order, so (starting a new row of signs corre- 
sponding to ranking 2) we write a minus under the recorded pair (12). Similarly, 
the pair in ranking 2 corresponding to 15 in ranking 1 is 36. This is in the same 
order as the recorded pair, so we write a plus below the existing plus under the 
recorded pair (15). The pair in ranking 2 corresponding to 23 in ranking 1 is 41. 
This is in the reverse order of the recorded pair, so we write a minus below the 





os ss & 





M. G. KENDALL 279 


recorded pair (23) in the row of signs corresponding to ranking 2. And so on. 
This takes rather a long time to explain but the process is really very simple. 
The array corresponding to expressions (3) is then 


Recorded order: (12) (13) (14) (15) (16) (23) (24) (25) (26) (34) (35) (36) (45) (46) (56) 
Rankingl1: + + + + + - —- + + -—- + + + + «= 
Renkng2: - + - + + - - + + + + + + + + 
(5) 
Now in expression (5) there are eleven cases in which both rankings have the 
same sign and therefore 15—11 = 4 in which they have the opposite sign. The 
coefficient 7 is then given by 
11-4 7 


ea ed | 


Generally, if there are, in two rankings of n arrayed as above, S, cases of the same 
sign and 8S, of opposite sign 











— 2(8,—83) 
n(n —1) 
48, 48, 
= nw—l) ~ an—1)° (8) 
If we arrange ranking 1 in the natural order 1, ..., », then every case in which 


there are the same signs in expression (5) corresponds to a case in which pairs in 
ranking 2 are in the natural order; and every case of different sign to one in which 
the pairs in ranking 2 are in the reverse of the natural order. The definition of (6) 
thus accords with the one originally given in my 1938 paper. It is often con- 
venient to take one ranking to be the natural order 1, ..., m so that the first row 
of signs in (5) are all positive. For instance, on rearrangement of the rankings in 
(3) we have 

bs 1 2 3 4 5 6 

i ee on” ao ek fe (7) 


and the array of paired comparisons becomes 


Recorded order: (12) (13) (14) (15) (16) (28) (24) (25) (26) (34) (35) (36) (45) (46) (56) 
Ranking 1: + + + + + + + + + + + + + + OF 
Ranking 2: - + =—- + + + + + + =— + + + + = 


Here again S, = 11, S, = 4, 71, = 74. 
6. Consider now three rankings, of which the first may be taken to be the 
natural order 1, ..., n, for example, 


woo = 
iv) 
ms 
He 
bo 
a 
or 


: s &. 2.8. 2 (9) 











280 Partial rank correlation 


The corresponding array of pairs is 


Recorded order: (12) (13) (14) (15) (16) (23) (24) (25) (26) (34) (35) (36) (45) (46) (56) 
Ranking 1: + SF . eee oe 


+ + + + + + + + + 
Ranking2: -—- + — + + + + + + = + + + + ~« = 
Ranking3: -—- —- + —- + —- + + + + + + = = + 
(10) 
For the coefficients 7 we have 

7 

Ti, (as above)= —, 

12 ( l= 5 

a 

eo UB oa 

ee 

” ee ae 


in the last case S, being as usual the number of cases in which pairs in rankings 
2 and 3 have the same sign. 


Consider now the fourfold table setting out the occurrences of + and — 
signs in the rows of expression (10) corresponding to rankings 2 and 3: 


Ranking 2 
+ - Total | 
o* 
of + 6 5 11 
S 
a + — — (11) 
= 
= - 3 1 4 | 
Total 9 6 15 





Here, for example, there are six cases in which pairs of ranks have the same 
(positive) sign, five in which ranking 2 is negative while ranking 3 is positive, 
and so on. 


Generally, if for three rankings of n the table is of the form 


Ranking 2 
+ = Total | 
¥ % | 
“ a b a+l 
bo | + a (12) 
FI 
5 
x - Cc d c+d 


| Total a+c | b+d N 
| | 





ae 
“ 











M. G. KENDALL 281 


I define the partial rank correlation of 2 and 3 on 1 as 


shee ad — be 
23.1 "+ /{(a+6) (c+d) (a+c) (6+d)} 


ee 
= [% (14) 


,_{n ; : , 
where N = (3) and x? is the ordinary mean square contingency for the four- 


fold table. 
In the particular case here considered the coefficient is 


38 
Te31 = 5 ~ — = — 0-185, 
(4. 11.9.6) 


as compared with T23 = — 0-067. 





(13) 


In the case when ranking | is not the natural order 1, ..., » the same prin- 
ciples apply, but in considering the communalities of 2 and 3 we count as con- 
tributing to the a-cell in (12) the pairs which are themselves of the same sign and 
are also of the same sign as the term in ranking 1; and so on. 

7. The partial 7 defined in equation (13) is a coefficient of association in 
the 2x 2 table suggested by Yule (1912). When the attributes of the table are 
independent and only in this case it is zero. It is + 1 only if 6 and c are both zero 
(i.e. if the two rankings agree in all pairs and hence are identical) and —1 only 
if a and d are both zero (i.e. if the two rankings disagree in all pairs and one is 
the reverse of the other). In this latter property of attaining unity only when 
diagonally opposite cells are both empty it differs from two other coefficients 
proposed by Yule.* 

Thus partial 7 as defined is a measure of the association of agreements of the 
rankings 2 and 3 when compared in pairs with ranking 1. From this viewpoint 
it will be seen that the use of the word ‘partial’ conforms to that in the theories 
of association and correlation. The partial associations of A and B in a third 
population containing C and y are those of AC and BC or of Ay and By. The 
partial association of ranks is that of pairs of agreements of 21 and 31. The partial 
correlation of 2 and 3 when 1 is constant is paraileled by the partial rank correla- 
tion of 2 and 3 when 1 is in the natural order 1, ..., n. If partial 7 is unity in 
absolute value the rankings coincide or one is the reverse of the other, whether 
they agree with ranking 1 or not, so that the coefficient fulfils its proper function 
of measuring the relationship between 2 and 3 independently of the influence of 1. 

* Yule himself arrived at the coefficient by considering product-moments in a 2x2 table. 
Karl Pearson & Heron (1913) mistook Yule’s intention and thought the coefficient was proposed 
as an estimate of the correlation in a normal population whose frequencies were given by a double 
dichotomy in the 2 x 2 table. Their long memoir is mainly devoted to advocating the alternative 
merits of tetrachoric r. The two things, as Yule has emphasized, are quite different. I mention 
the point to make it clear that the Pearson-Heron criticisms of Yule’s coefficient, even if not mis- 


founded, do not affect the above work, since I use the coefficient purely as a measure of association 
in the fourfold table. 








Partial rank correlation 


8. We may establish the remarkable result 


To3.1 = “e 
1 (1=Tia)# (1 —T3s)* 


T23 — 712713 





In fact, from expression (12) we see that 


_ (4+6)—(c+d) 


T12 


= 


To3 


N 





N 


N 


Remembering that N = a+b+c+d we have 


T23 — 712713 = qallatb+e+d){(a+d)—(b+0)} 
—{(a+b)—(c+d)} {(a+c)—(b+d)}] 


4 
ye! 


ad — be). 


(a+c)—(b+d) 


_ (a+d)—(b+¢e) 


> 


> 


Equation (15) follows at once from the definition of partial 7 in equation (13). 
The appearance of the product-moment type of relation between total and 
partial correlations is surprising. There was no reason to expect that partial 1, 
which is a pure function of disarrangements*in rankings and is not expressible 
algebraically in terms of the ranks, should bear any analogy with the partial 
correlation of variates; but since it does so, we are evidently fortified in regarding 


partial 7 as a convenient measure of rank correlation. 


Example. Ten men are ranked according to (1) intelligence, (2) mathematical ability, 
(3) musical ability. The rankings are: 


1: 1 2 
2: 1 4 
3: 4 1 


3 


4 5 6 
6 2 7 
5 2 6 


7 
3 
7 


8 
9 
10 


8 
9 


10 
10 
8 


It will be found that 7,, = 0-644, 7,, = 0-644, 7,, = 0-555. Thus mathematical and 


musical ability are positively correlated. The question is, can this correlation be attributed 
to the correlation of both with intelligence? 


We find 


T23.1 = 


0-555 — (0-644)? 


‘1—(0-644)2 | 
= 0-24. 


The conclusion would be that although part of the total correlation is due to the correlation 
of both with intelligence, part of it is not. We cannot attribute the whole of the observed 
(total) correlation between mathematical and musical ability to the interference of common 


correlation with intelligence. 





: 








M. G. KENDALL 283 


9. The same methods are immediately capable of extension to paired com- 
parisons. In fact the array of type (10) is the array of paired comparisons for all 
the possible pairs of ranks and when there are no constraints of the ranking 
character (i.e. such that if A B and BC then must A—(C) the coefficient 7 
becomes a measure of agreement in paired comparisons (cf. Kendall & Babington 
Smith, 1940). We could then construct measures of partial agreement by the 
same formulae in cases where it was suspected that there were mutual influences 
at work between three observers, as for instance if it were suspected that com- 
munity of preference between two children was due to community of both with 
one of the parents. 

10. In conclusion, it may perhaps be desirable to point out that although 
partial 7 is defined by equation (14) in terms of x? for a fourfold table, its signi- 
ficance cannot on that account be tested in the Type III x? distribution with one 
degree of freedom. [ have not yet succeeded in finding expressions for the sampling 
distribution of partial 7 but it seems clear that the Type III distribution will not 
be reproduced in the ranking case, at least without some substantial modification, 
even when the rankings are independent; for there exist correlations between the 
signs given by any ranking in the recorded order. If, for example, (12) and (23) 
are positive so must be (13), whereas if (13) and (23) are positive (12) may be 
either positive or negative. The units in the fourfold table cannot therefore be 
regarded as allocated at random and the type III distribution will probably not 
hold. I hope to return to this subject on a later occasion. 


REFERENCES 


KENDALL, M. G. (1938). A new measure of rank correlation. Biometrika, 30, 81. 

KENDALL, M. G. & BaBrineton Smits, B. (1940). On the method of paired comparisons. 
Biometrika, 31, 324. 

Prarson, Kart & Heron, D. (1913). On theories of association. Biometrika, 9, 159. 

Yue, G. Upny (1912). On the methods of measuring the association between two attri- 
butes. J.R. Statist. Soc. 75, 579. 


Biometrika xxx 











INEQUALITIES FOR MULTIVARIATE FREQUENCY 
DISTRIBUTIONS 


By C. E. V. LESER 


GIVEN a frequency distribution y of a single variate x, with arithmetic mean 
% = Oand with standard deviation 7, Tchebycheff’s (1867) well-known inequality 
presents a lower limit for the ratio of the frequency of all values of x between 
— Ago and Ac to the total frequency, where A= 1. In the case of the special class 
of frequency distributions for which y(x) + y(—2) is a non-increasing function of 
|x|, this inequality can be substantially improved to another limit which applies 
to all positive values of A, as already Gauss (1823) has proved. Various authors* 
have generalized these theorems by modifying the assumptions made with 
regard to the frequency distribution, by introducing moments of higher order 
than the second or by extending some of the results to bivariate functions. In 
the present analysis only moments of second order are used, but the results apply 
to frequency functions of any number of variates. 

Suppose y(x,,...,%,) to be a frequency distribution of n variates, with total 
frequency equal to unity, arithmetic mean at the origin and with standard 
deviations o,,...,0,. Let P be-the frequency of all combinations of 2,,...,x 


for which - A 
| a | ) | Ln < 
sh + ne HES Sn. 
AO; Ann 


n 


Write a oO) = J seniors 

o= /inee..41az? 7 4/ To84...4 1/08? 
so-that Aj and co? are the harmonic averages of A?, ..., A2, ando?, ...,0? respectively. 
We also write 


4 fmt 9 “=— (¢ = | eee) 'S 
0 


0 
fo \2 fo \2 x, \2 x. \2 
p= |l(2)'+..+( 22) = avn, (2) ++ (2) 
N MYT LnTn | AQe; Anon, 
so that P is the frequency of all values of x, ..., x, for which R/,Jn SA,o,. Further- 
more, we define A(R,) as the average value of y for all those values of z,,...,x 
for which R has a fixed value R,, and therefore 


n 


} oT lydzy... dz,, 
A(R,) = =". ; 
sas ty... dt 





| R=R, 


* E.g. K. Pearson (1919), B. H. Camp (1922), S. Narumi (1923), C. D. Smith (1930), P. O. Berge 
(1937). 


C. E. V. LEsER 285 


We can now state the following: 

THEOREM. Assuming the frequency distribution to be such that A(R) is a 
non-increasing function of R for R/,/n<Ko, we have one of the following three 
sets of inequalities, according to the value of x: 


I r=i 
(i) A,=1 P=0, 
1 
(ii) LSA, P2\1-~ 
R 
wa 
II. ises(*—} 
n 
Q \ver n+2/ 1 = 
<a sends) |) pemes 
(1) oS (553) : Pe 2 ( ale , 
) 1/n P ] 
i (ra) S405" =I 
] 
(iii) KSA, P21-~—. 
AZ 
9\+ 
IIL. (“= KK 
, _{ 2 \¥"(n+2)\! . wr 
" dS (559) ( n =(<73) ar 
9 1/n n+2 4 9 In 2/n ] 
-s : ee gk x 
( ) (<5) ( n s4oS (575) i P2 exe Aj 
sti De we : 1 
(ili) a KSA,SK I 21-7) 
' : l 
(iv) KSA, I 21-53 


Proof. We introd 


so that 


uce the new system of co-ordinates 
%, = 4,7, Roost, cost, costs... cost,,_y, 
Lg = flgT, Rsin t, cost, cost, ... cost, 1, 


Le = fT, KR sin f, costs... cost,,_1, 


lq = fT, KR sin ¢._1, 


er t,) da, ...dx,, = 2(R, ty, ...,t,)) R"dRdt, ...dt,_1, 


where 2(R, ty, ..-)tn—a) = y(R, t, ---, ya) COS ty COS* ty ... COS" Ey. 





286 Inequalities for multivariate frequency distributions 
We also write 8 = g,n, 


and we are going to use the following abbreviating symbol 


° C2 tn tn 
| flande = | } cs Pity eon tea) yg 


J t,=0J/t,.=—hn 
Then | } R™-12(R, 1)dRdt = 1, 
R=0) 7 


As 
| | R"-12(R, t)dRdt = P, 


R=0 


T 
increasing function of R for R< xs. Furthermore; let us write 


and as A(R) = const. { 2(R, t) dt, we have the condition that | 2(R, t) dt is a non- 
‘ r 


G= | 2(Ay8, t) dt, 
T 


and later on u=—s", 


Now start from the equation 


is) Gea) 
PyT,, Fut n 


[" - 
J 4=—@ 


which can be written 





Fg tae 
+ (; : | ue a 


jo( 1 1 os n+l, 
(Aooe)* (5+ +98) -{ [2 +12(R,t)dRdt 


n R=0/J T 
or s? = I, +h, 
Ags a!) P 
1, -| } R"+12(R,t)dRdt, I, =| R"*12(R, t)d Rat. 
R=0/ 7 R=A,.sJ T 





According to the value of xs, three cases have to be distinguished. 


1j/n 
(a) es | (yay'+ GU?) | or P21—(x"—Af)u. 


For R<A,s [ 2(R,t)dt=G. 
Jr 
PAys 
Hence L=G | R+1dR, 
R=0 
Ags 
and incidentally P2G R-1dR = Af u. 


R=0 





C. E. V: LESER 287 
To obtain a limit for J,, we make use of the equation 


-[a.sr+2a-p]” 


[ | R'.(R,t)dRdt = 1-P = @ Rak 
J R=A,8J T J R=As 
or | - | R2(R, t)d Rat 

R=[(A8)"+20-P)] ST 





1/R 
[a.sr+Za -P)| 
‘i } rel G > 


R=Aos 


(R,t)at |aR. 
r 


. 


As the integrands are nowhere negative and R nowhere smaller in the integral 
on the left than in the integral on the right, we can write 


} R"+1z(R, t)d Rat 
JT 


% 
ijn 
YR=[(s ae 1-P)| 
(Acs) a! ) 


n n > — 
(Aos)" + gi-P) 


. pelo | a(R, tat |aR, 
J R=AQ8 oe 
»[ aus" n a-p)]" 
L2G] ‘ RR, 
J R=As 


and therefore 
(Ags)"4 ” (1-P) . Y n a (m+2)/7 
st>G ~— pean ES 


~ J R=0 (m + 2) Gin 
n2n(n + 2) u® = [n(Afut 1—P)\rr", 


n +2 ni(n+2) ; ; 
P21+Aju- (“=*) utn+2) 
n 


l/n 
(6) Ayssnss| (os) +5(1-P)| or Ps1—(x"—A®)u, A,SxK. 
P Ags 
As before, L2G| R'1dR and P2Atu. 
/ R=0 


Furthermore, we have the equation 
2@ a KS 
R-12(R,t)dRdt =1-P=G [ 


, J R=A, 


Revar+| 1- PT (e"—25)5" |, 





| R=A 8d 


& | Rn-12(R,t)dRdt = | rl g—| (R,t)at |aR 
a, J R=A 8 T 


R=x: JT J 


+[1 - P—F(er—ag)a" |, 











288 Inequalities for multivariate frequency distributions 


and, as before, it follows that 


% [, R+12(R, naRatz |" ™ res| G—| (, dat |aR 
R=ks R=A 8 ? 
+ («s)8| 1 P— Cer agya |, 


Ks a 
126] R’1d R + (xs)? [1-P-F (x"—Aj)s dy 


R=A 9s 


Hence #26 |" ReaR + (xs)¢| 1- PF ~(K"—A?')s |. 


R=0 


n . . : 
12 5 k"ttu + K*[1— P—(x" —AG) u], 


N+ 2 
1) 2 
P2(1-2)+(as- — x) 
ey n+2 
(c) K8SSAgs or Aj2zk. 


In this case, J, need not be larger than 0, but 
I, > (Ags)? (1— P), 
s? > (Ags)? (1 — P). 
Therefore, if A,21, P21— 1B and if Ay < 1, no significant limit exists. 
2 


For A,2«, the problem has thus been solved, but the case Ay <x is more 


complicated; summarizing the results of (a) and (b), we have two alternative sets 
of inequalities. Write 


fil) = Agu, fo(u) = 1—(K"—Ag)u, 


n+2 n/(n+2) Ri 
F3(u) = 1+Aju -( . *) yrn+2) 


n 
] 2 ’ 
fu) = (1 —sa)+(% ~n+2* )u. 
Then either P2f, Pz2zf,. P2f, 
. P2f, PSfy Pf, 


For different frequency distributions, G and therefore uw may assume any 
non-negative value which is compatible with the condition P< 1. We have to 
find the effective lower limit for P as a function of w and its minimum value 
which, as easily seen, is only larger than 0, if f,(0) > 0 or, what is the same thing, 
K>1. 

fi» fo, fy are straight lines. As 


af. 9 1 )\ n/(n+2) 
df, nm es. u- n/(n+2) 


n 


fs 


C. E. V. Leser 289 


f, has a minimum when 


n+ 9/ 9 (n+2)/n l 9>/ D> \2Qn 1 
ygnin.) a: =| ios ) eg = ss 
n \n+2 Agt® n\n+2) Agt 
9 > \2n 9 ) 2/n > \2Jn 
and fern = J+ “( =: La _M ae =| - ) 1 ao -( = i 
; = = 
n\n+2} AR n \n+2) A n+2} R&R 


Indicating by u;;, f;; the co-ordinates of the point in which the curves f; and f; 
intersect, we find that 


ee 
wns fis = Ao 
12 = kn? Jie)? 
i ae ge (f m \ 
U3 = | ; J , fis =| -.,| AY, 
\n+ = n+2 
n+2/, i\.3 f n+2/, 1 | Xo ® 
=e a) | — ale) 
fo, fz and f, have a point (W343, f3,) in common in which /, is the tangent to f,: 
n+2 1 f 1 n+2K"—AZ 
Us< = P 3°) 2 = —_— 
234 nm ute? S238 3 


nN Kt +2 


ee be lad d ; , ; 
It is also seen that the sign of both expressions ~* and u,3,— ug" is positive 
’ au = 
{ 2 \’ n 
or negative according to whether A, 2 K. 


<\n4+2) 


We have therefore four possibilities. In the following diagrams, the heavy 
? fo) x 





2 wd 
1 l 





| Case 
| 
| 
























‘ ‘a u 
1 Case 3 ete — fi il 7 Case 4 Dee rab a 
| a N Gi 
Pg hs Pe > 
a i % iA —h, 
i —_ 
Le ee | ia = i 
a i 4 —— ~< 
Ps —~ 
Siig <f sie 
yr “~S 3 e 
0 _ << 04]@$-$ A$ >_> 











290 Inequalities for multivariate frequency distributions 


line shows the effective lower limit for P as a function of u (the upper limit is 
always equal to 1, of course), and its minimum gives the general lower limit for P. 
In these four cases, the following results are obtained: 


(1) Ug4 = U9, P2fis 











(2) | so] — { oe ug 2 ths Poh 
(3) du yfnin.) < U3 P 2 fy 
df, 
4 v4>9Q P=f,(0), 
(4) at =fx(0) 
and by substituting the proper values: 
y g prop 
_ [n+2\ 2 \ln n+2 1\ (Ay 
() 1ses(*=) MS(a5a) P2*>(!-al(Z 
2 jn n+ 2 4 n tn 
‘ ~A, s|— ——. P2=|——)} A? 
2)| 2 (2+) 05(; 3} | n 2 (573) ; 
> K2 P 
= 6 In 9 1j/n 9\4 x. 2/n | 
3 | * 2 . n+2 ee 
a n+2 R22 n+2, n ) sili m+2) 22? 
) ijn a 
(@)eBL (Toa) “esas Pel-3 


1 : 
In addition, we know that for Ay2x, Ay21: P21 — >a By rearranging these 
0 


inequalities, we can bring them into the form in which they were given in the 
theorem, which is therefore proved. 

It may be remarked that «x depends on the ratios between any two of the 
quantities A,,...,A,, but it remains the same if A,,...,A, change in the same 
proportion. 

Let us consider the sets of inequalities I, II and III of the theorem separately. 
The most important case in which set I is relevant occurs when nothing about the 
frequency distribution is known except the averages and standard deviations of the 
variables, in which case we have to put « = 0. It providesa generalization of Tcheby- 
cheff’s inequality which is obtained in the special case n = 1 in which A, = A. 

If n = 2, the inequality refers to the frequency of all points lying inside the 
ellipse which has the axes A, ./20,, A,,/2@, and it can be written in the following 
way: 

) wepserer th }) 

1 2 1 2 
It is interesting to compare this limit with the one given by Berge (1937) for a 
rectangle which has its corners in the points with the co-ordinates (+ Ao,, + Ac,): 
P21- —s 
where r is the correlation coefficient of the frequency distribution. If r = 0, 
Berge’s limit equals 1 —2/A? and is therefore equal to the limit we get for the 


nr 


C. E. V. Lesser 291 


ellipse with the axes Ao,, Ao, which is inscribed to the rectangle. On the other 
hand, if r = + 1, Berge’s limit becomes equal to 1 — 1/A?, and we have to choose 
the circumscribed ellipse with the axes A,/2o,, A./20, to obtain the same limit. 
Hence, the limit given here is certainly not inferior to the limit given by Berge, 
if the two variates are independent, but it is not superior to this limit, if there is 
a perfect correlation. 

Set IT is perhaps of more theoretical than practical interest. An intermediate 
value of k may, however, be realized, if there is sufficient information about the 
frequencies inside, but not outside, a certain n-dimensional interval. The in- 
equalities correspond to those given by Narumi (1923) for frequency distributions 
of one variate only, but having a more general meaning in so far as P may refer 
to multiples of other quantities besides the standard deviation. The first inequality 


seems the most interesting one of this set; it reduces to a special case of Narumi’s 
inequality if we put » = 1: 
+3 3 1\A 
Asik:P2 <<; \ 


Ky kK? 
and for n = 2 it may be written 


Ags: P22(1- =) (2) Ao\” 
V2 Ke 


t-% 
or sot 


1 
re ata’ - 24(1- a) «2(1/A2 + 1/A8)" 

Set III generalizes Gauss’s inequalities. It reduces to the first two inequalities 
for those values of A,,...,A,, (if any) for which x = 00. This is the case for all 
values of A,,...,A,, if y(hay,...,hxv,)+y(—hay,,..., —hx,) is a non-increasing 
function of || for any fixed values of 2,,...,2,; especially if the function y 
decreases monotonically along every straight line radiating from the origin. 

The assumption x = 0 will be made throughout the following analysis. 
Gauss’s formulae are obtained by substituting n = 1: 


, 2 Pp A 
As—=,:F 27> 
. v3 
4 
Az—,:P21- 
th 9A" 
In the case n = 2, the inequalities are also greatly simplified: 
Ao < & ig = 2 ; 
so. 
=. 
1 1 1 
—=+3522:P2 ew 
” Ag gS *** = Tags ae 
1 l iva ] 
5 <2:P21- — of — 
gas???" tag 








292 Inequalities for multivariate frequency distributions 


Returning to the case of an unspecified value of n, we shall often be interested 
in a limit for the frequency of all values of the standardized or original variables 
for which the distance from the origin is no more than a certain value, i.e. either 


xy 4 2 
~- + io > = be ’ 
oF n 
or e+... +22 Sp? 
In the first problem, we have 
fe 
De HE ons Sg 


and our inequalities can therefore be written 


n 
2< 92) 9)(n—2)/n « at 
be < 2 n(n + 2) ) n- P= (n+) 


( 2 2/n n 
pM? = 27/"(n + 2)e—2in yt 1-( 3) ‘fe 
In the second problem 


A,o,=...=A,0,<2f, A= vs 


Hence, if 


2 2 2 $n 
p?S22M(n + 2yn-Bn TT + FEn» p> = 
= n =| (n+ 2) (o?+...+0% Q 
1 n 


2 2 ( 9 2/1 gp2 2 
p= 27/(n + 2) -ayn Ft ++ TOs :P2>1 “| = ait... +9; 
n 
Again, the insertion of special values for will simplify the formulae. 

Finally, it may be at least of theoretical interest to consider the generalized 
Gauss limit, as obtained for x = 00, as a function of n for fixed values of Ay, and 
to compare it with the generalized Tchebycheff limit which is obtained for x < 1 


and is independent of n. For some particular values of A, and n, this is done in 
the following table: 





a | | 
| 1 2 3 
n | 
| | 
| | | Gauss limit | | 
| 1 | 0577 | 0-889 | 0-951 | 
| - | 0-500 | 0-875 944 | 
| 3 0-457 | 0-864 0-940 
4 





| Tchebycheff limit 


| | 
| 0-423 | 0-856 | 0-936 | 
| 0 | 0-750 | 0-889 | 
| 





C. E. V. LEser 293 


; 2 \¥/n+2\4 
Furthermore, since for Ay < (= ("=") : 
n+2 n 


: n \3” ‘ 2 
lim (=*5) A? s lim -—- = 0, 
n+2 


n—> asa8+2 


° . 9 Qin ] l 
and since lim | 1—(|——-} —]|=1-~, 
n—> oo n+2 AZ AR 


it is seen that with an increasing number of variates, the Gauss limit loses gradu- 
ally its superiority over the Tchebycheff limit and the difference between the 
two limits tends to vanish. 


REFERENCES 


BERGE, P. O. (1937). A note on a form of Tchebycheff’s theorem for two variables. Bio- 
metrika, 29, 405-6. 

Camp, B. H. (1922). A new generalization of Tchebycheff’s statistical inequality. Bull. 
Amer. Math. Soc. 28, 427-32. 

Gauss (1823). Theoria combinationis observationum erroribus minimis obnoxiae. Gottingen, 
pp. 9-12. 

Narumt, 8. (1923). On further inequalities with possible applications to problems in the 
theory of probability. Biometrika, 15, 245-53. 

Pearson, K. (1919). On generalized Tchebycheff theorems in the mathematical theory 
of statistics. Biometrika, 12, 284-96. 

Smirx, C. D. (1930). On generalized Tchebycheff inequalities in mathematical statistics. 
Amer. J. Math. 52, 109-26. 

TCHEBYCHEFF (1867). Des valeurs moyennes. J. Math. pures appl. ser. 2, 12, 177-84. 











THE MODE AND MEDIAN OF A NEARLY NORMAL 
DISTRIBUTION WITH GIVEN CUMULANTS 


By J. B. 8. HALDANE, F.R.S. 


In the early years of biometry the mode and median of a distribution, and 
especially the latter, were regarded as being almost as important as the mean. 
Later Pearson and others developed the method of moments, and recently the 
cumulants, which are readily derived from the moments, have been widely used. 
Pearson (1895 and after) discussed the relation of the mean, mode and median 
in some of the skew frequency curves which he invented. He found empirically 
that for skew curves of Type III, namely, 


x\P 
y = us(2 +3) ——, 


when p is positive, Mode — Mean = 3 (Median — Mean) approximately. By fitting 
for a series of integral values of p he found 


Median — Mode 


pi act ek ne ; = 
eae a 0-6691 + 0-0094p-1. 


Some later writers have taken the trisection as a general law. But as we shall 
see, it does not always hold, even approximately. So far as I know, generai 
expressions for the mode and median in terms of the cumulants have not been 
given, nor have the conditions Seen stated under which Pearson’s rule holds 
approximately. 


Consider a variate X, with distribution df = F(X,dX, and cumulants 
Ky = M, Ke = O*, Kq,.00) Kp, 000 

The algebra is simplified if we make the transformation x = (X —m)/c, so 
that x has mean zero, and unit standard deviation, its cumulants being 0, 1, y,, 
Voy «+> Vps ++) Where Y,_» = K,Kz *”. Thus y, is the rth measure of the deviation from 
normality of the distribution of X. Now y, may become infinite for all values 
of r, or for all even values of r, above a certain value. It may diminish indefinitely 
or remain small. In other cases it falls with r at first, but then increases without 
limit. Thus for Pearson’s Type III distribution, whose equation has been given 
above, 


Kk, = (I (p+y(ZV, and y, = (r+1)!(p+1)-*. 


And in the case of any estimate of a statistic, such as the mean, variance, 
standard deviation, or skewness, based on a sample of n members, y, = O(n-?). 
We shall not discuss the convergence of the series obtained later. But it is worth 


J. B. S. HALDANE 295 


noting that expansions in Hermitian polynomials are often satisfactory asymp- 
totic expansions, even if they diverge. 


Let df = f(x) dx be the distribution of x. Then it is known that we may write, 
symbolically,* 


v.j/ad\* d \4 ae e-}z? 
= ety) 286-2) 4 TS)... 1 fees pei 
fa) = exp ata) +H(Z) oa (=) ~ |S 


‘ a - : , a 
Expanding in Hermitian polynomials H,(x) = e+” re e-t=* we have 
” : 





e-? 


fie) = Te E — 2 (x) +72 Hy) —73 Hea) 
: 
¢ Et Fem ny 2° Ys H(z) + =) 


In the special case where y, = O(n-?’) we have 


e-* ; : 
fle) = Tar | 1-78 Hale) + 3 a) — Hela) + Fh Hole) 


The terms needed to find the root of this equation with the smallest absolute 
value depend, of course, on the behaviour of the cumulants as r increases. If 
|y2|, |¥3|, ete., are not substantially less than | y,|, but all are small, so that 
powers and products can be neglected, then: 


Where y, = O(n-*"), we have 


o_ (1. 5%2_ 3577), M1 Ys_ 357172, 357i i an 
V1 (1+ 3 eee tear = * w= + O(n-*) . 


~ a 3 
Hence t= Yr, Ya_ Yr Ys, M14 O(n-2). 


2 8 12 4 


So in the first case the mode is 





V1 ,¥3 Vs (— Pe. 
m+o(-% 2 a or tA or! ree 
=K . s  —.S- (=) Kors (1) 
at a ae” ee” ee 


* For a comparison of this expansion with that of the Charlier Type A distribution, see Cramér 
(1937). The symbolic relation has also been used recently by Cornish & Fisher (1937). 








296 The mode and median of a distribution 
If x, = O(n), then the mode is 


3 3 
m—Jo(y,— 724 M2) 4 O(n-s 


_ Ha, Hs _ Fata, + O(n-2)... (2) 


Qu, Suz 123 * 4yd 


x 
The median is the value of x for which | _f(u) du = 3, or, since 


—co 


the value for which 


coal ent du + on | 2H (a) +22 Hy(x) + |- 0 
V(27) Jo V(27) . 24% 
szt |“ p-tut gy, — 11 Ye Wp) _Y3 ia 
So e [ie du 6 H,(x) +94 Ai, (x) 120 H,(x)+... = 0. 
If y, does not fall off systematically we have 
tunis Du ‘ (—Y Yar + 


640 336°" 2(ar41)r! 

When x, = O(n), we find 
ae yx" Y2 577 V1 3, 572 3577 
ins dew ati Pik Res ee Ae ee ae ee —#) = 
3. 6 +( a r)*+% sa hee 


_ 11, %3_ 17%2 171, py 
Hence seats 12 * 304 + Om ). 


So if powers and products can be neglected the median is 


{ 
_%1, Y3_ ¥s 
m+o| 6 +407 3367" 
Se: ek (=) Keras 
— 1 Gy 40K8 3368 THArti)rigt 


And when xk, = O(n), the median is 


m—jo( 21-24. 2% we O(n-) 


3 20 6 162 


_ Ka, Ks Kgkq, 17K§ 


16K, 40K 12k3 © 324x4 — 


, bs Hs Lsfta 1743 OX -*). (4) 


O(n-*) ... 


~ 1 Gg” 403 12p3 * 324y§ 


J. B. S. HALDANE 297 


Thus the distance from the mean to the mode is approximately thrice that 
from the mean to the median if y2, y,, etc., are small quantities (i.e. 8, is nearly 3, 
etc.) and if y3, y;, etc., are small compared with y,. This is often the case for 
nearly normal distributions. Thus for the best known of Type III distributions, 
that of y? for n degrees of freedom, x, = n, K, = 2n, k, = 2"-(r—1)!n. Hence 
from equation (2) the mode is at n—2+O(n-*). In fact it is n—2 exactly. For 


: 2 32 , . 
the median, equation (4) gives n — 3 a En O(n-?). The fraction which Pearson 
a 2 8 
empirically estimated at 0-6691 + 0-0094p~ is therefore 3 + 105p + O( p-*). 


Consider the distribution of the estimated mean, n-12z,, from a sample of n, 
taken from a distribution with arbitrary cumulants k,, kK, Kz, etc. The cumulants 
of the distribution of the mean are k,, K,/n, K,/n®, etc. Hence equations (2) and 
-(4) hold, and the mean is x,, the mode is 


and the median is 


Ks Fes KyK, lig 
K,-= 


een SP i A 4 AUS) n-24 O(n-). 
Gnk,  \40xK2 12k3 AL 7 


The corresponding expressions in terms of the moments may be written down. 
The skewness is not exactly 1/n of the skewness of the original curve, but nearly 
so if n is large. 

Again consider the distribution of the estimated variance from a sample of 
n members from a normal distribution with unknown mean, and variance o°. 


a . . 1 
The cumulants of the distribution of the estimated variance —* [X22 —n-1(Lz,)?] 
are 

i 2 ae. ek ee 
a ee eee Ks = (a—1)® Ka = @—)’ 5 (n—1)*” E 


Thus we are dealing with a slightly transformed y* distribution, and the mode is 
2o7 (n—3)o° 


, or ~ 
—1 n—1 


; 2 32 iia of 12 4 On-3) 
1-5, allt. leg 3n 405n? ; 


If, however, the distribution sampled is not normal, but has a finite xy, etc., 
then the first three cumulants, in terms of those of the original distribution, are 


o? — , exactly. From equation (4) the median is 


tg, 8 et 2 ES ES. 
ln (n—1)® n(m—1)? n(n—1) nn? 








n— 
Fisher (1928) gives expressions for the next three cumulants, but these become 
very complex, «(2*) having 21 terms. 











298 The mode and median of a distribution 


It follows that the mean is «,, the mode 


4x2 + 8kykK,y+K, 
K,—| 2x,+—+ _34 ‘| n+ O(n-*), 
° [ 2 2(2K3 + K,) = 
and the median 
2K 4K3+ 8kKyKyt Kg ‘ 
—| +e Cd + O(n). 
“21S 62x38 + K,) ro 
The distribution of the estimate of «, is symmetrical for a symmetrical dis- 
tribution. In general its mean is ks, its mode 


NG tf POs Wing tN + 


| 45x Borwesae sens n-1+0(n-2), 
te [ re 2( 6x3 + 9KZ + Ky Ky + Ke) | ll 


the median differing from the mean by } of the value given above. 
Finally the mean estimate of «x, for a sample from a normal distribution is, 
6 : : 1204 
zero, its mode — 360" + O(n), and its median +o. The general 


expressions for the mode and median can readily be calculated from Fisher’s 
(1928) equations. 

We now pass to some cases where Pearson’s trisection rule does not hold, 
even approximately. 

Pearson’s Type I and Type IV curves are asymmetrical, and have one more 
adjustable parameter than Type III. Thus yg, i.e. 8. —3, can vary independently 
of y,, i.e. £3. In consequence the curves may be nearly symmetrical, but far from 
normal. This is so if they approximate to Type II or Type VII, respectively. In 
this case the even measures of divergence y,, may be much larger than the odd 
measures 7Y2,,,, Which tend to zero with symmetry. Thus we cannot neglect 
higher cumulants, or products, as in equations 1-4. For Type IV and VII curves 
the higher moments are infinite, so no formulae of the given type are possible. 
For Types I and II a formula could be given, but direct calculation is clearly 
preferable. 


A simpler case is that of the scalene triangular distribution whose graph is 
9 
obtained by joining (—6, 0), (0 4), and (a, 0). That is to say 
_ b+2) Aansat 3 2(a— x) i 
= ba +) for bex<0, and ¥- aia for O0<x<a. 
Here the mean is }(a—b), the mode O, and the median, if a>), is 


(a—b)*  (a—b) 





a—./[fa(a+6)], or a + 





32a 128a? 


That is to say, the median is one-quarter, not one-third, of the distance from the 
mean to the mode, if a—b is small compared with a. The rth moment about zero 
2[ar+t — ( = b)r+] 

r(r+ 1) (a +b) 


is 





J. B. S. HALDANE 299 


a—b [2k(9 + 2k2)] 6 /[2k(9 + 2k2)] 

If k= Jab)’ then y, = eee =—-% ¥s= Se etc. 
If k is small, the odd y’s are of order k, whilst the even ones approximate to those 
of the symmetrical distribution. Moreover y, = —32%y,, so the formulae 1-4 
clearly do not hold. 


We see then that formulae (2) and (4) are quite useful in a special case which is 
important in sampling theory, but have no general validity. 


SUMMARY 


Expressions are obtained by which the distances of the mean and median of 
a skew distribution can be calculated from its moments or cumulants. The series 
obtained may or may not converge. They give satisfactory results for Type III 
distributions, and for the distributions of the mean, variance, and other cumu- 
lants as estimated from samples. 


REFERENCES 


CornisH, E. A. & FisHer, R. A. (1937). Moments and cumulants.in the specification of 
distributions. Rev. Inst. Int. Statist. Pt. 4. 

Cramm&r, H. (1937). Random Variables and Probability Distributions, Cambridge Mathe- 
matical Tracts, 86-88. ; 

FisH=r, R. A. (1928). Moments and product moments of sampling distributions. Proc. 
Lond. Math. Soc. Ser. 2, 30, 199. 

Prarson, K. (1895). Contributions to the mathematical theory of evolution. II. Skew 
variation in homogeneous material. Philos. Trans. 186, 343. 


Biometrika xxx _ 











TABLE OF PERCENTAGE POINTS OF THE 


t-DISTRIBUTION 


ComputTepD BY MAXINE MERRINGTON 


Tue following table has been derived from Miss Thompson’s Tables of Percentage Points of the 
Incomplete Beta Function (Biometrika, 32, 168-81), by taking 


t = ,/{v.(1—2x)/v, x} 


for the case vy, = 1. ¢ is the usual “Student”’ ratio based on an estimate of variance having v= v2 
degrees of freedom. If tp is the quantity tabled, P/100 is the chance that | t | > tp, ie. represents 
the area in the two tails of the t-distribution. The table includes certain levels for ¢ not previously 
available and should be accurate to the five significant figures shown. 


\ 





P 


50 


1-00000 
0-81650 
0-76489 
0-74070 


0-72669 
0-71756 
0-71114 
0-70639 
0-70272 


0-69981 
0-69745 
0-69548 
0-69384 
0-69242 


0-69120 
0-69013 
0-68919 
0-68837 
0-68763 


0-68696 
0-68635 
0-68580 
0-68531 
0-68485 


0-68443 
0-68405 
0-68370 
| 068335 
| 068304 
| 





0-68276 
0-68066 
0-67862 
0-67656 
0-67449 











2-4142 
1-6036 


1866 


1-1848 
11831 
1-1816 
1-1802 
1-1789 


1-1777 
1-1766 
1-1757 
1-1748 
1-1739 


1-173] 
1-1673 
1-1616 





6-°3138 
2-9200 
2-3534 
2-1318 


2-0150 
1-9432 
1-8946 
1-8595 


1-8331 


“8125 
-7959 


*7396 
*7341 
‘7291 


*7247 
*7207 
‘7171 
-7139 
‘7109 


—— ss | 


‘7081 
-7056 
-7033 
‘7011 
‘6991 


“6577 
6449 


—s se 
S 





| 
| 


12-706 
43027 
3-1825 
2-7764 


bo bo bo by to 
~ 
oS 
x 


2-0860 
2-0796 
2-0739 
2-0687 
2-0639 


2-0595 
2-0555 
2-0518 
2-0484 
2-0452 


2-0423 
2-0211 
2-0003 
1-9799 
1-9600 











2-5 1 
9452 | 63-657 
6:2053 | 99248 
4:1765 | 5-8409 
3-4954 4-6041 

| 4-032] 


3-7074 
3°4995 
3°3554 
3°2498 


3°1693 


2-5096 


| 
2-4899 | 2-9467 
2-4729 | 2-9208 
2-4581 2-8982 
2-4450 2-8784 
2-4334 2-8609 
2-4231 2-8453 
2-4138 2-8314 
2-4055 2-8188 
2-3979 2-8073 
2-3910 2-7969 
2-3846 2:7874 
2-3788 2-7787 
2-3734 2-7707 
2-3685 2-7633 
2:3638 2-7564 
2-3596 2-7500 
2-3289 2-7045 
2-299) 2-6603 
2-2699 2-6174 
2-2414 2-5758 


0-5 


127-32 
14-089 
7:4533 


5-5976 


4:7733 
4:3168 
4-0293 
| 38325 
| 3-6897 | 


35814 
3-4966 
3:4284 
3°3725 
3°3257 


| 3.2860 
| 3-2520 
32225 
31966 
3-1737 


3-0905 


| 30782 
3-0669 
3°0565 
3-0469 
3-0380 


| 3.0298 
| 92-9712 
2-9146 
2-8599 
2-8070 





THE PROBABILITY INTEGRAL OF THE RANGE IN SAMPLES 
OF n OBSERVATIONS FROM A NORMAL POPULATION 


I. FOREWORD AND TABLES 
By E. 8. PEARSON 
1. Scope of the main table 


Denote by 2, %, ..., ©, a random sample of n observations, arranged in ascending order of 
magnitude, drawn from a normal or Gaussian population having for probability law 


1 (~—p)? 
ple) = FaasgeeP| -4 I | - 


where #4 and @ are respectively the mean and standard deviation of the population. The 
range, sometimes described as the spread, in the sample is x,, — 2, and we shall write the ratio 
of the range to the population standard deviation as 
w= vs Bais (2) 
Co 
No simple expression exists for the probability law f,(w) of w, but Table 1 below gives 
for specific values W of w computed values of the probability integral 
ry 
PAW) = I ,(w) dw (3) 
J 0 
This expression is the chance that the range in a sample of n observations is less than a 
given multiple of the population standard deviation. The table has been calculated for 
samples with n lying between’2 and 20 and for intervals of 0-05 of W. The values of the 
integral are given to 4-decimal place accuracy; linear interpolation is adequate except in 
the neighbourhood of the two quartiles in each column. The method of calculation is 
described below by Dr H. O. Hartley in a separate section. 


2. Ausxiliary table for special uses (Table 2) 


When dealing with samples containing only a small number of observations the range or 
spread may often be usefully employed as a measure of dispersion in place either of the 
standard (root mean square) deviation or the mean deviation. For example, an estimate of 
the population standard deviation may be obtained by multiplying the range in a single 
sample or the mean range in a number of samples by the factors a, shown in Table 2. The 
accuracy of this form of estimation of o compared with that of other methods has been 
discussed by Davies & Pearson (1934). 

In other circumstances it may be useful to plot in serial order on a, control chart the 
values of range obtained from successive samples, e.g. when dealing with the control of 
quality in mass production. For this purpose it is necessary to know certain standard prob- 
ability levels for w which will serve as control limits*. Twelve of these levels expressed as 
percentage points and obtained by interpolation in the main Table 1, are shown in Table 2. 
They replace approximate limits published a few years ago (Pearson, 1932). It will be found, 
however, that except in the case n = 3+, the discrepancies between the two tables are all 
small. As an example, the table shows that if samples of 7 observations are randomly drawn 
from a population with a standard deviation a, then in the long run only 5 % of these should 
have a range greater than 4-170, while 95 % should satisfy the inequality 

1-250 <2, —%,<4:490 

Probability levels for samples with n> 12 have not been included as the use of range for 
control purposes in larger samples is of doubtful value. 

* For a discussion of the use of range in problems of industrial quality control see Reports 


issued by the British Standards Institution (Pearson, 1935, pp. 89-90; Dudding & Jennett, 1942). 
+ Correct values for the case n=3 were given by McKay & Pearson (1933). 











Table 1. Probability integral of the range W in normal samples of size n 















































es 2 3 4 5 5 7 8 9 | 10 
Ww 
0:00 | 0-0000 | 0-0000 
0-05 | -0282| -0007 | 0-000 
0-10 | -0564| -0028| -0001 
0-15 | -0845 | -0062| -0004 | 0-0000 
0:20 | -1125| -0110| -0010| -0001 
0-25 | 0-1403 | 0-0171 | 0-0020 | 0-0002 
0:30 | -1680| -0245| -0034| -0004 | 0-0000 
0:35 | -1955| -0332| -0053| -0008 | -0001 
0-40 | -2227| -0431| -0079| -0014| -0002 | 00000 
0-45 | -2497| -0543| -0111| -0022| -0004| -0001 
0:50 | 0-2763 | 0-066 | 0-0152 | 0.0033 | 0-0007 | 0-002 | 0-0000 
0-55 | -3027| -0800| -0200| -0048! -0011| -0003| -0001 
0-60 | -3286| -0944| -0257| -0068 | -0017| -0004| -0001 | 0-0000 
0-65 | -3542| -1099| -0323| -0092| -0025| -0007| -0002| -0001 
0-70 | -3794| -1263| -0398| -0121| -0036| -0011| -0003| -0001 | 0-0000 | 
0-75 | 0-4041 | 0-1436 | 0-0483 | 0-0157 | 0-0050 | 0-0016 | 0-0005 | 0-0002 | 0-0001 | 
e-80 | -4284| -1616| -0578| -0200| -0068| -0023| -0008| -0002| -0001 
0-85 | -4522| -1805| -0682| -0250| -0090!| -0032| -0011| -0004| -0001 
0:90 | -4755| -2000| -0797| -0309| -0117| -0044| -0016| -0006| -0002 
0-95 | -4983] -2201| -0922/ -0375| -0150| -0059| -0023| -0009| -0003 
1-00 | 0-5205 | 0-2407 | 0-1057 | 0-0450 | 0-0188 | 0-0078 | 0-0082 | 0-0013 | 0-0005 
1-05 | -5422| -2618| -1201| -0535| -0234| -0101| -0043| -0018| -0008 
1-10 | -5633| -2833| -1355| -0629| -0287| -0129| -0057| -0025| -0011 
1-15 | -5839| -3051| -1517| -0733| -0348| -0163| -0075 | -0035 | -0016 
120 | -6039| -3272| -1688| -0847| -0417| -0203| -0098| -0047| -0022 
1-25 | 0-6232 | 0-3495 | 0-1868 | 0-0970 | 0-0495 | 0-0250 | 0-0125 | 0-0062 | 0-0030 
1-30 | -6420| -3719| -2054| -1104| -0583| -0304| -0157, -0080| -0041 
1-35 | -6602| -3943| -2248| -1247| -0680| -0366| -0195| -0103| -0054 
1-40 | -6778| -4168| -2448| -1400| -0787| -0437| -0240| -0131 | -0071 
1-45 | -6948| -4392| -2654| -1562| -0904| -0517| -0292| -0164| -0092 
1-50 | 0-712 | 0-4614 | 0-2865 | 0-1733 | 0-1031"| 0-0606 | 0-0853 | 0-0204 | 0-0117 
1-55 -7269 | +4835) -3080/ -1913| -1168| -0705| -0422| -0250)| -0148 
1-60 | -7421| -5053 | -3299| -2101| -1316| -0814| -0499| -0304| -0184 
1-65 | -7567| -5269| -3521| -2296| -1473| -0934| -0587| -0366| -0227 
1-70 | -7707| -5481| -3745| -2498| -1639| -1064| -0684| -0437| -0277 
1-75 | 0-7841 | 0-5690 | 0-3971 | 0-2706 | 0-1815 | 0-1204 | 0-0792 | 0-0517 | 0-0336 | 
1-80 | -7969| -5894| -4197| -2920| -2000| -1355| -0910| -0607 | -0403 | 
1-85 | -8092| -6094| -4423| -3138| -2193| -1516| -1039| -0707| -0479 | 
1-90 | -8209| -6290| -4649| -3361| -2304| -1686| -1178| -0818| -0565 | 
1-95 | -8321| -6480| -4874| -3587| -2602| -1867| -1329| -o940| -0661 | 
| 2-00 | 0-8427 | 0-6665 | 0-5096 | 0-3816 | 0-2816 | 0-2056 | 0-1489 | 0-1072 | 0-0768 
2:05 | -8528| -6845| -5317| -4046| -3035| -2254| -1661| -1216| -0886 
2:10 | -8624| -7019| -5534| -4277| -3260| -2460| -1842| -1371| -1015 
215 | -8716| -7187| -5748| -4508| -3489| -2673| -2033| -1536| -1156 
2:20 | -8802| -7349| -5957| -4739| -3720| -2803| -2232| -1712| -1307 
2-25 | 0-8884 | 07505 | 06163 | 0-4969 | 03955 | 03118 | 0-2440 | 0-1899 | 0-1470 
2-30 | -8961| -7655| -6363| -5196| -4190| -3348| -2656| -2095| -1645 
2-35 | -9034| -7799| -6558| -5421| -4427| -3582| -2878| -2300| -1830 
| 2-40 | -9103| -7937| -6748| -5643| -4663| -3820| -3107| -2514| -2025 | 
| 2-45 | -9168| -8069| -6932! -5861| -4899| -4059| -3341| -2735| -2230 | 
| 28 | 0-9229 | 0-8195 | 0-7110 | 0-6075 | 0-5132 | 0-4300 | 03579 | 0-2964 | 0-2443 
| 

















302 





Table 1 (cont.). Probability integral of the range W in normal samples of size n 
































n | 
- 11 12 13 14 15 16 | 47 1s | 49 20 
| 
| 
| | 
| | 
| | | 
| | | | 
|| 
| 
| 
| 
| 
| 
| | 
| 0-85 | 0-0000 | 
0-90 | -0001 | 
0-95 | -0001 | 0-0000 | | 
| 1-00 | 0-0002 | 0-0001 | 0-0000 | | 
| 1-05 | -0003 | -0001 | -0001 | 
| 1-10 | -0005 | -0002 | -0001 | 0-0000 | | 
1-15 | -0007| -0003 | -0001 | -0001 | 
| 1-20 | -0010} -0005 | -0002)| -0001 lyre | 
| 
| 


| 1-25 | 0-0015 | 0-0007 
| 1-30 “0021 -0010 
| 1-35 -0028 0015 
| 1-40 -0038 -0021 


0-0000 | 





-0005 | -0003 | -0001 -0001 | 0-0000 | 
-0008 -0004 -0002 | -0001 | -0001 | 0-0000 


0-:0004 | 0-0002 | 60-0001 
| 
-0001 -0001 


‘0011 -0006 -0003 | -0002 


1-45 | -0051| -0028 | 0016 | -0009| -0005 | -0003 | -0001 | -0001 | 0-0000 | 
1:50 | 0-0067 | 0-0038 | 0-0022 | 00012 | 0-0007 | 0-0004 | 90-0002 | 0-0001 | 0-0001 | 0-0000 | 
1:55 | -0087| -0051 | -0030| -0017| -0010 | 


-0006 
1-60 ‘O111 -0067 -0040 -0024 ‘0014 | -0008 | 
1-65 -0140 0086 -0053 -0032 -0020 
1-70 “0175 -O111 -0069 -0043 -0027 


-0003 | -0002 | -0001 -0001 
-0005 -0003 -0002 -0001 
‘0012 | -0007 | -0004 | -0003 | -0002 
‘0017 | -0011 | -0007 -0004 | -0003 

















1:75 | 0-0217 | 0-0140 | 0-0090 | 0-0058 | 0-0037 | 0-0024 | 0-0015 | 0-0010 | 0-0006 | 0-0004 





























| 2-20 0994 | -0753| -0569| -0429| -0323| -0242| -0181 | -0136| -0102| -0076 


| 1-80 -0266 | -0175 “0115 | -0075 | -0049 | -0032 ‘0021 | -0013 | -0009 | -0006 
1-85 -0323 -0217 “0145 | -0097 | -0065 | 0043 | -0028 | -0019 | -0012] -0008 
1-90 ‘0388 | -0266| -0182| -0124 | -0084 | -0057 | -0039 | -0026 | -0018| -0012 
1-95 “0463 -0323 -0225 | -0156.) -0108 | -0075 | -6052 | | -0035 | -0024 | -0017 

| ] 
2-00 | 0-0548 | 0-0389 | 0-0276 | 0-0195 | 0-0137 | 0-0097 | 0-0068 | 0-0048 | 0-0033 | 0-0023 
2-05 -0643 ‘0465 | -0335 0241 ‘0173 | -0124 | -0088 | -0063 | -0045 | -0032 
| 2-10 0749 | -0550 | -0403 | -0295 | -0215) -0156 | -O114 -0082 -0060 | -0043 
| 2-15 ‘0866 | -0646 | -0481 -0357 -0264 | -0196 | -0144 | “0106 | “0078 | -0057 


| 
25 0-134 | 0-0872 0-0669 | 0-0511 

















| 
0-0390 | 0-0297 |o0 0-0226 
| 
| 
| 


2: | 0-0172 | 0-0130 | 0-0099 
2:30 | -1286| -1003| -0779| -0605| -0468| -0361| -0279| -0215| -0165| -0127 
2-35 | -1450| -1145| -0902 | -0709 | -0556 | “0435 | -0340 | -0265| -0207| -0161 
2-40 | -1625| -1300| -1037| -0825| -0655| -0519| -0411 | 0325 | -0256| -0202 
2-45 | -1811| -1466/ -1183| -0953| -0766| -0615| -0493 0394 | -0315| -0251 
2 


| | 


-50 | 0-2007 | 0-1644 | 0-1342 | 0-1094 astad lenaee: 0-0586 | 0-0474 | 0-0383 | 0-0309 
‘ | — = 


303 

















Table 1 (cont.). Probability integral of the range W in 


normal samples of size n 











| 


SSE 


| 2-50 
2°55 | 


2-60 
2-65 
2-70 


| 2-75 


2-80 
2-85 
2-90 
2-95 


3-00 
3-05 


| 





| 0-9229 


3-10 | 


3-15 
3-20 


3-25 
3-30 
3-35 
3-40 
3-45 


3-50 
3°55 
3-60 
3-65 
3-70 


3°75 
3-80 
3-85 
3-90 
3-95 


4-00 
4-05 
4:10 
4-15 
4-20 


4:25 
4-30 
4:35 
4-40 
4-45 


4-50 
4-55 
4-60 
4-65 
4-70 


4-75 
4:80 
4-85 


| 4-90 
4-95 | 


5-00 





| 


| 


-9286 
-9340 
-9390 
-9438 


0-9482 
*9523 
-9561 
*9597 
-9630 


0-966] 
-9690 
-9716 
-9741 
‘9763 


0-9784 
-9804 
+9822 
-9838 
-9853 


0-9867 
*9879 
‘9891 
-9901 
“9911 


0-9920 
-9928 
-9935 
-9942 
-9948 


0-9953 
*9958 
9963 
-9967 
-9970 


0-9974 
*9976 
-9979 
*9981 
*9984 


0-9985 
*9987 
“9989 
“9990 
*9991 


0-9992 
‘9993 





“9994 | 


-9995 


| 


-9995 | 


0-9996 


0-8195 
“8315 
*8429 
*8537 
-8640 


0-8737 
+8828 
“8915 
-8996 
-9073 


0-9145 
*9212 
-9275 


*9334 


-9388 


0-9439 
-9487 
-9531 


*9572 | 


-9609 


0:9644 
-9677 
‘9706 
*9734 
‘9759 


0-9782 
-9803 
-9822 
-9839 


‘9856 


0-9870 
*9883 
“9895 
-9906 
-9916 


0-9925 
-9933 
*9941 
‘9947 
*9953 


0-9958 
‘9963 
*9967 
*997] 
*9974 


0:9977 
-9980 
*9983 
“9985 
*9987 


0-9988 


0-7110 
-7282 
-7448 
-7607 
-7759 


0-7905 
*8045 
‘8177 
-8304 
+8424 


0-8537 
*8645 
-8746 
-8842 
-8931 


| 0-9016 


*9095 
-9168 
-9237 
-9302 


-9361 
*9417 


9468 | 


9516 
9559 


0-9600 
-9637 
-9672 
-9703 
‘9732 


iii 


‘9758 
-9782 
-9804 





-9824 
“9842 


0-9859 
“9874 


-9887 


‘9899 
-9910 


0-9920 
-9929 
*9937 
“9944 
*995) 


0-9956 
*9962 
-9966 
-9970 
“9974 


0-9977 











8 


0-3579 
+3820 
-4064 
-4309 


+4555 


0-4801 
-5044 
-5286 
+5525 
-5760 


| 0-5991 


-6216 
6436 
-6649 
-6856 


| 0-7055 


-7248 
-7432 
-7609 
“7778 


0-7939 
-8091 
-8236 
*8372 
*8501 


0-8622 


| 


5 6 7 
0-6075 pers 0-4300 
6283 | -5364 | -4541 | 
“6487 | -5592 | -4782 | 
6685 | -5816 | -5022 
6877 | -6036 | -5259 | 
0-7063 | 0-6252 | 0-5494 | 
‘7242 | -6461 | -5725 | 
‘7415 | -6665 | -5952 | 
‘7580 | -6863 | -6174 | 
‘7739 | -7055 | -6390 
0-7891 | 0-7239 | 0-6601 
8036 | -7416| -6806 
8174 | -7587 | -7003 | 
8305 | -7750| -7194 
8429 | -7905| -7377 
| 
0-8546 | 0-8053 | 0-7553 
8657 | -8194| -7721 
‘8761 | -8327| -7881 
*8859 | -8454| -8034 
*8951 | -8573 | -8179 
0-9037 | 0-8685 | 0-8316 
‘9117-8790 | -8446 
‘9192 | -8889 | -8568 
-9261-| -8981 | -8683 
9326 | -9067| -8790 
0-9386 | 0-9148 | 0-8891 
9441 | -9222| -8985 
9493 | -9291 | -9073 
9540 | -9355 | -9155 
‘9583 | -9415| -9230 
0-9623 | 0-9469 | 0-9300 
9660 | -9519 | -9365 
9693 | -9566 | -9425 
9724 | -9608 | -9480 | 
‘9752 | -9647 | -9530 | 
| 0-9777 | 0-9682 | 0-9576 
‘9800 | -9715 | -9619 
‘9821 | -9744| -9657 
‘9840 | -9771 | -9692 
‘9857 | -9795 | -9724 
| 0-9873 | 0-9817 | 0-9754 
‘9887 | -9837 | -9780 | 
‘9899 | -9855 | -9804 
‘9911 | -9871 | -9825 | 
9921 | -9885 | -9845 | 
0-9930 | 0-9898 | 0-9862 
‘9938 | -9910 | -9878 | 
9945 | -9920 | -9892 | 
‘9952 | -9930 | -9904 
9958 | -9938 | -9916 | 
0-9963  0-9946 


0-9926 


-8736 
*8842 
*8941 
“9034 


0-9120 
-9199 


*9273 | 
-9341 
-9404 | 


0-9461 
-9514 
-9562 
-9607 
-9647 


0-9684 


-9717 | 


‘9747 


‘9775 


‘9799 


0-9822 
*9842 
-9860 
“9876 
-9890 


0-9903 








304 


hen 











9 40 
0-2964 | 0-2443 
-3198 | -2665 
+3437 | -2894 
-3680 | -3130 | 
-3927 | -3372 | 
0-4175 | 0-3617 
-4425 | -3867 
-4675 -4119 
-4923 | -4372 
5171 | -4625 
0-5415 | 0-4878 | 
5656 | -5129 
5892 | -5378 
6124 | -5623 
-6350 | -5864 
0-6569 | 0-6099 
6782 | -6329 
-6988 | -6553 
7186 | -6769 
7376 | 6978 
0-7558 | 0-7180 
7732 | -7373 
7898 | -7558 
8055 | -7735 
8204 | -7902 
0-8345 | 0-8062 
-8477 | -8212 
| +8602 8355 
8718 8488 
-8827 | -8614 
0-8929 | 0-873] | 
| -9024 | -8841 
‘9112 | -8943 
-9193 | -9038 
| ‘9269 | -9126 | 
| 0-9338 | 0-9208 
| -9402 | -9283 
| +9460 | -9352 
‘9514 | -9416 
| +9563 | -9474 | 
| | | 
| 0-9608 | 0-9527 
-9649 | -9575 
-9686 | -9620 
‘9719 -9660 
‘9750 | -9696 
0-9777 | 0-9729 
-9802 | -9759 
-9824 | -9786 | 
‘9844  -9810 | 
-9862 | -9832 | 
0-9873 | 0-985] | 


| 





Table 1 (cont.). Probability integral of the range W in normal samples of size n 
































| 
S41 aa | oe [oe | oe | oe | 46 | 17 | 18 | 19 | 20 
{ | Ww es } | 
iaoat | | s j Seen s ees cree Perret Seine 
| 2-50 | 0-2007 | 0-1644 | 0-1342 | 0-1094 | 0-0890 | 0-0722 | 0-0586 | 0-0474 | 0-0383 | 0-0309 
| 2-55 | -2213| -1833 | -1514] -1247| -1026| -0842 | -0690| -0565| -0462/ -0377 
| 2-60 | -2429.| -2033 | -1697]| -1413 | “1174 | -0974  -0807| -0668 | -0552) -0455 
| 2-65 | -2653 | -2243/| -1891 -1591 -1336 | -1120 |) -0937 | -0783 | -0654! -0545 
| 2-70 | -2885 | -2462 | -2096 | -1780 | -1509 | -1278 | -1080! -0911 | -0768) -0647 
| 2-75 | 0-3124 | 0-2690 | 0-2311 | 0-1981 | 0-1696 | 0-1449 | 0-1236 | 0-1053 | 0-0896 | 0-0761 
| 2-80 | -3368 | -2926| -2536| -2194 71894 | -1632 | -1405 | -1208| -1037) -0889 
| 2-85 | -3617 | +3169 | -2770| -2416| -2103! -1829) -i587 | -1376! -1192| -1031 
2-90 *3870 | -3417| -3011 | -2647 | .9324 -2036 -1782 -1558 -1360 | -1186 
2:95 | -4126| -3670| -3258 | -2887 | -2554 | -2255 | -1989| -1752| -1542 | -1355 
| | 
| 3-60 | 0-4382 | 0-3927 | 0-3512 | 0-3134 | 0-2792 | 0-2484 | 0-2207 | 0-1959 | 0-1737 | 0-1538 
{ 3-65 -4639 4186 | -3769 | -3387 | -3039 -2723 -2436 -2178 | -1944| -1734 
} 3-10 -4895 4446 -4029 | +3645 *3292 | -2970 -2675 | -2407 | -2164| -1943 
3-15 |. -5150 4706 | -4292 | -3907] -3551 | -3224 | -2923 | -2647| -2394| -2164 
3-20 -5401 -4965 +4555 “4171 | *3814 +3483 3177 -2895 | -2635 | -2396 
3-25 | 0-5649 | 0-5222 | 0-4817 | 0-4437 | 0-4081 | 0-3748 | 0-3438 | 0-3151 | 0-2885 | 0-2638 
3-30 -5893 -5475 5078 | -4703 | -4348 -4016 -3704 -3413 | +3142) -2890 
3-35 -6131 -5725 -5337 | -4967 | -4617 “4286 -3974 -3681 | -3407 +3150 
3-40 -6363 5970 -5592 | -5230 | -4885 -4557 -4246 -3953 | -3677 | -3417 
} 3-45 6589 6209 *5842 | -5489 | +515] -4827 -4519 -4227 | -3950 | -3689 
| } | 
3-50 | 0-6807 | 0-6442 | 0-6087 | 0-5744 | 0-5413 | 0-5096 | 0-4792 | 0-4502 | 0-4226 | 0-3964 
3-55 7017 -6668 6326 | -5994 -5672 +5362 -5063 -4777 -4504 -4242 
3-60 -7220 *6886 6558 | -6237 -5926 -5624 -5332 -5051 4781 | -4522 
3-65 “7414 -7096 6782 | -6474| -6173 “588i +5596 -5321 -5056 | -4801 
3-70 -7600 -7298 -6998 -6704 | -6414 -6132 -5856 -5588 -5329 -56078 
3-75 | 0-7776 | 0-7491 | 0-7206 | 0-6925 | 0-6648 | 0-6376 | 0-6110 | 0-5850 | 06-5598 | 0-5352 
3-80 7944 -7675 *7406 | -7138 *6873 -6613 -6357 -6106 -5861 | -5622 
3-85 8103 -7850 -7596 -7342 -7090 -6841 -6596 -6355 “6118 -5887 
3-90 8254 -8016 7777: |) +7537 -7298 | -7061 -6827 -6596 -6369 -6145 
3-95 8395 | -8173 7948 | -7723| -7497)| -7273| -7050 | -6829| -6611 | -6397 
4-00 | 0-8528 | 0-8321 | 0-8111 | 0-7899 | 0-7686 | 0-7474 | 0-7263 | 0-7053 | 0-6845 | 0-6640 
4:05 -8653 *8460 8264 -8065 | -7866 -7666 -7466 -7268 -7070 | -6874 
4:10 | -8769 | -8590 | -8408| -8223| -8036  -7848 | -7660 | -7472 | -7285| -7099 
4:15 | -8878 | -8712 8543 | -8371 | -8196| -8021 | -7844 | -7667 | -7491 | -7315 
4:20 | -8978 -8826 8669 | -8509 -8347 -8183 -8018 -7852 -7686 -7520 
) | | | | 
4:25 | 0-9072 | 0-8931 | 0-8787 | 0-8639 | 0-8488 | 0-8336 | 0-8182 | 0-8027 | 0-7871 | 0-7715 
4-30 | -9159 9029 8896 -8760 -8620 -8479 -8336 -8191 -8046 | -7899 
4:35 | -9238 -9120 -8998 | -8872 *8744 *8613 “8480 *8345 +8210 -8073 
4-40 | -9312 | -9204| -9092| -8976| -8858 | -8737| -8614! -8490| -8364)| -8237 
4-45 *9379 *9281 ‘9178 ‘9073 “8964 +8853 -8740 *8625 -8508 “8391 
4-50 | 0-9441 | 0-9352 | 0-9258 | 0-9162 | 0-9062 | 0-8960 | 0-8856 | 0-8750 | 0-8643 | 0-8534 
4:55 | -9498  -9417 | -9332 | -9244 | -9153  -9060! -8964 | -8867 | -8768 | -8667 
/ 4-60 -9550 ‘9475 -9399 ‘9319 -9236 -9151 -9064 -8975 “8884 | -8791 
4-65 ‘9597 -9530 *9460 ‘9388 -9313 *9235 “9155 9074 “8991 *8906 
4-70 -9640 “9579 “9516 “9451 “9383 -9312 *9240 “9165 -9090 | -9012 
4-75 | 0-9678 | 0-9624 | 0-9567 | 0-9508 | 0-9446 | 0-9383 | 0-9317 | 0-9249 | 0-9180 | 0-9110 
| 4-80 ‘9713 *9665 -9614 -9560 “9505 -9447 -9387 -9326 -9264 | -9199 
4-85 ‘9745 *Gii02 -9656 “9608 *9558 “9505 -9452 -9396 *9340 -9281 
4-90 ‘9774 “9735 “9694 +9650 *9605 “9559 “9510 *9460 -9409 -9356 | 
4:95 ‘9799 “9765 ‘9728 “9689 “9649 -9607 ‘9563 “9518 -9472 *9424 
| } 
5-00 | 0-9822 | 0-9791 | 0-9759 | 0-9724 0-9688 | 0-9650 | 0-9611 0-957] | 0-S529 | 0-9486 | 
| | 





305 








Table 1 (cont.). 


Probability integral of the range W in normal samples of size n 



























































n 

ne 2 3 4 5 6 7 8 9 40 
ws 

5-00 | 0-9996 | 0-9988 | 0-9977 | 0-9963 | 0-9946 | 0-9926 | 0-9903 | 0-9878 | 0-9851 
5-05 -9996 -9990 -9980 -9967 -9952 -9935 -9915 -9893 -9869 
5-10 | -9997| -9991| -9982| -9971| -9958| -9942/| -9925| -9906| -9884 
5-15 | -9997| -9992| -9985| -9975| -9963| -9950| -9934/| -9917| -9898 
5-20 | -9998| -9993| -9986| -9978| -9968| -9956| -9942| -9927| -9911 
5-25 | 0-9998 | 0-9994 | 0-9988 | 0-9981 | 0-9972 | 0-9961 | 0-9949 | 0-9936 | 0-9922 
5-30 | -9998/| -9995| -9990| -9983| -9975| -9966| -9956| -9944/| -9931 
5-35 | -9998| -9995;| -9991| -9985| -9979| -9971| -9961| -9951| -9940 
5-40 | -9999| -9996/| -9992| -9987| -9981| -9974| -9966| -9957| -9948 
5-45 | -9999| -9997| -9993| -9989| -9984| -9978| -9971| -9963| -9954 
5-50 | 0-9999 | 0-9997 | 0-9994 | 0-9991 | 0-9986 | 0-9981 | 0-9975 | 0-9968 | 0-9960 
5-55 | -9999| -9997| -9995| -9992| -9988| -9983| -9978| -9972| -9965 
5-60 | -9999| -9998| -9996| -9993| -9989| -9985| -9981| -9976| -9970 
5-65 | -9999| -9998/| -9996| -9994| -9991| -9987/| -9983| -9979| -9974 
5-70 | 0-9999 | -9998| -9997| -9995| -9992| -9989| -9986| -9982| -9977 
5-75 | 1-0000 | 0-9999 | 0-9997 | 0-9995 | 0-9993 | 0-9991 | 0-9988 | 0-9984 | 0-9981 
5-80 -9999 | -9998| -9996| -9994| -9992| -9989| -9986| -9983 
5-85 -9999 | -9998| -9997| -9995| -9993| -9991| -9988| -9986 
5-90 -9999 -§998 -9997 -9996 -9994 -9992 -9990 -9988 
5-95 -9999 | -9998| -9998| -9996| -9995| -9993| -9991| -9989 
6-00 0-9999 | 0-9999 | 0-9998 | 0-9997 | 0-9996 | 0-9994 | 0-9993 | 0-9991 
6-05 -9999 | -9999| -9998| -9997| -9996| -9995| -9994/| -9992 
6-10 09999 | -9999| -9998| -9998| -9997| -9996| -9995| -9993 
6-15 10000 | -9999 | -9999| -9998| -9997| -9996| -9995| -9994 
6:20 -9999 | -9999| -9998| -9998| -9997| -9996| -9995 
6-25 0-9999 | 0-9999 | 0-9999 | 0-9998 | 0-9997 | 0-9997 | 0-9996 
6-30 0-9999 | -9999| -9999| -9998! -9998| -9997| -9996 
6:35 1-0000 | -9999| -9999| -9999| -9998| -9998| -9997 
6-40 0-9999 | -9999| -9999| -9998| -9998| -9997 
6-45 1-0000 -9999 -9999 -9999 -9998 -9998 
6-50 0-9999 | 0-9999 | 0-9999 | 0-9999 | 0-9998 
6°55 0-9999 -9999 -9999 -9999 -9998 
6-60 1:0000 | -9999| -9999| -9999)| -9999 
6-65 0-9999 | -9999| -9999| -9999 
6°70 1-0000 | 0-9999 -9999 -9999 
6-75 1-0000 | 0-9999 | 0-9999 
6°80 0-9999 -9999 
6-85 1-0000 | -9999 
6-90 0-9999 
6:95 1-0000 
7-00 

7-05 

7-10 

7-15 

7-20 

7-25 





306 














- 


Table 1 (cont.). Probability integral of the range W in normal samples of size n 





41 12 13 14 15 16 17 18 19 20 


de 
























































5-00 | 0-9822 | 0-9791 | 0-9759 | 0-9724 | 0-9688 | 0-9650 | 0-9611 | 0-9571 | 0-9529 | 0-9486 
5-05 -9843 -9815 ‘9786 | -9756 | -9723 -9690 | -9655 | -9618 | -9581 -9543 
5-10 “9861 -9837 “9811 -9784 -9755 | -9725 | -9694/ -9661 -9628 | -9593 
5-15 -9878 -9856 | -9833 -9809 | -9783 -9757 -9729 | -9700 | -9670)| -9639 
5-20 -9893 | -9874| -9853 | -9832 -9809 | -9785| -9760| -9735)| -9708| -9681 
5-25 | 0-9906 | 0-9889 | 0-9871 | 0-9852 | 0-9832 | 0-9811 | 0-9789 | 0-9766 | 0-9742 | 0-9718 
5-30 -9917 -9903 “9887 | -9870)| -9852 -9833 -9814 | -9794)| -9773) -9751 
5-35 -9928 | -9915 -9901 “9886 | -9870| -9854 -9836 | -9819 | -9800| -9781 
5-40 -9937 -9925 | -9913 | -9900) -9886 | -9872 “9856 | -9841 -9824 | -9807 
5-45 -9945 | -9935 | -9924 -99i2 -9900 | -9888 -9874 | -9860 -9846 | -9831 
5-50 | 0-9952 | 0-9943 | 0-9934 | 0-9924 | 0-9913 | 0-9902 | 0-9890 | 0-9878 | 0-9865 | 0-9852 
5-55 -9958 -9951 -9942 -9933 -9924 | -9914 -9904 -9893 -9882 | -9870 
5-60 -9964 | -9957 “9950 | -9942 -9934 -9925 | -9916 -9907 -9897 -9887 
5-65 -9969 -9963 -9956 | -9950 -9943 -9935 | -9927 -9919 | -9910)| -9901 
5-70 -9973 -9968 -9962 | -9956 “9950 | -9944 -9937 -9929 -9922 | -9914 
5-75 | 0-9976 | 0-9972 | 0-9967 | 0-9962 | 0-9957 | 0-9951 | 0-9945 | 0-9939 | 0-9932 | 0-9925 
5-80 -9980 -9976 | -9972 -9967 -9963 -9958 | -9952 -9947 -9941 -9935 
5-85 -9982 -9979 | -9976 -9972 -9968 -9963 -9959 -9954 -9949 | -9944 
5-90 | -9985 | -9982 | -9979 | -9976 -9972 -9968 -9964 | -9960) -9956 -9952 
5-95 | -9987 -9985 | -9982 | -9979 | -9976 -9973 “9969 | -9966 -9962 -9958 
6-00 | 0-9989 0-9987 | 0-9984 | 0-9982 | 0-9979 | 0-9977 | 0-9974 | 0-9971 | u-9967 | 0-9964 
6-05 | -9990 -9989 | -9987 | -9984 -9982 | -9980 -9977 *9975 -9972 -9969 
6-10 | -9992 -9990 -9989 -9987 -9985 | -9983 -9981 -9978 ‘9976 | -9973 
6-15 | -9993 -9992 -9990 | -9989 -9987 -9985 | -9983 -9981 -9979 | -9977 
| 6-20 | -9994 -9993 -9992 | -9990 -9989 -9987 “9986 | -9984 -9982 -9980 
| | 
| 6-25 | 0-9995 | 0-9994 | 0-9993 | 0-9992 | 0-9991 0-9989 | 0-9988 | 0-9986 | 0-9985 | 0-9983 





| 6-30 -9996 | -9995 | -9994| -9993 | -9992| -9991 | -9990 | -9988} -9987)| -9986 
| 6:35 | -9996| -9996 | -9995) -9994) -9993.| -9992; -9991 | -9990| -9989 | -9988 
| 640 | -9997| -9996 | -9996/| -9995| -9994/ -9993 | -9992| -9992]| -9991 | -9990 
| 645 | -9997 | -9997 | -9996 | -9096 | -9995 | -9994 | -9994| -9993] -9992]| -9991 
| 
| 
| 


| 6-50 | 0-9998 | 0-9997. | 0-9997 | 0-9996 | 0-9996 
| 


0-9995 | 0-9995 | 0-9994 | 0-9993 | 0-9993 
6-55 | -9998| -9998 | -9997 | -9997 | -9996 


-9996 | -9995 | -9995] -9994| -9994 





9997 | -9996| -9996| -9995]| -9995 

















| 660 | -9998| -9998 | -9998 | -9997| -9997 : 

| 6-65 | -9999 | -9998 | -9998 | -9998 | -9997/| -9997| -9997| -9996/ -9996 -9995 
6-70 | -9999 | -9999 | -9998 | -9998 | -9998 | -9998 | -9997 -9997 | -9997 | -9996 

| | | 
6-75 | 0-9999 | 0-9999  0-9999 | 0-9999 | 0-9998 | 0-9998 | 0-9998 | 0-9997 | 0-9997 | 0-9997 
6-80 | -9999| -9999 -9999| -9999| -9998| -9998| -9998| -9998]| -9998| -9997 
685 | -9999| -9999| -9999 | -9999| -9999| -9999| -9998| -9998 | -9998 | -9998 
6-90 | 0-9999 | -9999 | -9999 | -9999 | -9999 | -9999| -9999| -9998| -9998| -9998 
6-95 | 1-0000 | 0-9999 | -9999 | -9999 | -9999 | -9999| -9999| -9999] -9999| -9998 

| | 
7-00 | 1-0000 0-9999 | 0-9999 | 0-9999 | 0-9999 | 0-9999 | 0-9999 | 0-9999 | 0-9999 
7-05 0-9999 | 0-9999 | -9999 | -9999| -9999 | -9999| -9999| -9999 
7:10 | 1-0000 | 1-0000 | 0-9999 | 0-9999 | -9999 | -9999| -9999| -9999 
7-15 1-0000 | 1-0000 | 0-9999 | 0-9999 | -9999 | -9999 
7:20 | | 1-0000 | 1-0000 | 0-9999 | 0-9999 
7:25 | 1-0000 | 1-0000 

| 

| 






































307 





308 


E. S. Pearson 


3. The origin of the present tables 

Tables giving the expected or mean value and the standard deviation of range in random 
samples from the normal population of equation (1) were calculated by L. H. C. Tippett 
(1925) in the Department of Applied Statistics, University College, London. Since the 
probability distribution /,,(w) is itself far from normal in form, it was evident that its mean 
and standard deviation alone would not provide all the information generally needed in 
practice. Tippett included in his paper some values of the constants f, and /, of the dis- 
tribution and his work was extended by the present writer (Pearson, 1926, 1932) who 














Tabie 2 
] 
| Size | Lower percentage points Upper percentage points 
| of Factor | 
sample| a | | 
nw o1 | os | 10 | 25 | s0 | 100 | 100 | 50 | 25 | 10 | os 0-1 
| | | | es 
| 2 | 0-8862 | 0-00 | 0-01 | 0-02 | 0-04 | 0:09 | 0-18 | 2-33 | 2-77 | 3:17 | 3-64 | 3:97 | 4-65 
| 3 | 05908 | 0-06 | 0-13 | 0-19 | 0-30 | 0-43 | 0-62 | 2-90 | 3-31 | 3-68 | 412 | 4-42 | 5-06 
| 4 | 0-4857 | 0-20 | 0-34 | 0-43 | 0-59 | 0-76 | 0-98 | 3-24 | 3-63 | 3-98 | 440 | 469 | 5-31 
| 5 | 0-4299 | 0-37 | 0-55 | 0-66 | 0:85 | 1-03 | 1-26 | 3-48 | 3-86 | 4-20 | 4-60 | 4-89 | 5-48 
| 6 | 0-3946 | 0-54 | 0-75 | 0-87 | 1-06 | 1-25 | 1-49 | 3-66 | 4-03 | 4-36 | 4-76 | 5-03 | 5-62 
| & | 03698 | 0-69 | 0-92 | 1-05 | 1:25 | 1-44 | 168 | 3-81 | 4:17 | 449 | 488 | 5-15 | 5-73 
| 8 | 03512 | 0-83 | 1-08 | 1-20 | 1-41 | 1-60 | 1:83 | 3-93 | 4:29 | 461 | 4:99 | 5:26 | 5-82 | 
9 | 03367 | 0-96 | 1-21 | 1:34 | 1-55 | 1-74 | 1:97 | 4:04 | 439 | 4-70 | 5-08 | 5-34 | 5-90 | 
| | | | | | 
| | | | 
10 | 0-3249 | ae 1-33 | 1-47 | 1-67 | 1-86 | 2-09 | 413 | 4-47 | 4-79 | 516 | 5-42 | 5-97 
11 | 0-3152 | 1-20 | 1-45 | 1-58 | 1-78 | 197 | 220 | 421 | 4:55 | 4-86 | 5-23 | 5-49 | 6-04 | 
12 | 0-3069 | 1:30 | 1-55 | 1-68 | 1-88 | 2-07 | 2-30 | 4:29 | 4-62 | 4-92 | 5-29 | 5-54 | 6-09 | 
| | | | | | j 

















Estimate of o=a, x range (or mean range) in a sample of n observations. 


developed an approximate method of determining probability levels for w and provided 
some provisional tables of these. The need has, however, been felt for some time for a full 
and accurate table of the probability integral of the range to fit into place among other funda- 
mental tables associated with the normal distribution. The completion of this objective has 
been made possible by a grant from the Department of Scientific and Industrial Research 
whose assistance in the matter is acknowledged with warm appreciation. The actual method 
of computation was planned by Dr H. O. Hartley and the calculations were carried out under 
his supervision by Scientific Computing Service, Ltd. The scope of the main table was 
limited to n < 20. As n increases beyond this value there is an increasing risk that the table 
may be misleading in practice, since f,(w) becomes very sensitive to relatively slight 


departures from normality in the tails of the population distribution. 


Davies, O. L. & PEaRsOoN, E. 
DupDING, 


McKay, A. T. & Pearson, E. 8. (1933 
PEarsON, E. 8S. (1926). 
— (1932). 
—— (1935). 

Quality Control. 


Tippett, L. H. C. (1925). Biometrika, 17, 364. 


=. 2. 
Quality Control. 


REFERENCES 


& Juunert, W. J. (1942). 


Biometrika, 18, 173. 


Biometrika, 24, 404. 


The Application of Statistical Methods to Industrial Standardization and 
British Standards Institution, No. 600, (1935). 


ew J.R. Statist. Soc. Supp. 1, 76. 


The Application of Statistical Methods to 


British ahh, Institution, No. 600, (1942). 
). Biometrika, 25, 415. 





4 


| 








309 


II. NUMERICAL EVALUATION OF THE PROBABILITY INTEGRAL 
By H. 0. HARTLEY 


The formula used for the tabulation of the probability integral P,,( W) of the range in normal 
samples of size n is given in the paper printed on pp. 334-48 below, where it proved that 


+43W n a) u n-1 
( 2(2) ar) +2n | z(u) ( 2(2) az) du, (1) 
Jw R 


P,(W) = ( 
—iW u—W 


where 2(x) = (27r)-? e—}=?, 
Certain properties of this formula and the facilities provided by certain modern calculating 


machines make this integral amenable to tabulation. 
The main work consists in the evaluation of the integral 


(°cO “U n—-1 
2n =(w)( | (2) dz) du, (2) 
WwW J u- ) 





by quadrature for a two variable network of values of n and W. The range of integration is 
from $W up to a point where the integrand 


u n—-1 
z(u) ( 2(x) az) (3) 
J u-W / 


vanishes to 7-decimalaccuracy.* Foreach point of the network n, W, therefore, the integrand 
(3) was tabulated for a set of equidistant values of u covering the range of integration. The 
interval of integration was chosen as Ju = 0-2 throughout. This was sufficient to obtain 
about 5-decimal accuracy in the integral (2). 

The interval in W was taken as wide as possible but sufficiently fine to permit checking 
by differencing and the subsequent subtabulation of P,(W) to interval 0-05, which is the 
interval in the final table. An interval of 4W = 0-25 was therefore chosen for the n, W 
network. 

For small values of W it was necessary to tabulate the integrand for all integers n for 
which P,(W) is required in the final table. For larger values of n and W, however, it was 
sufficient to calculate the final integral (1) for odd n and to obtain intermediate values by 
interpolation. Below, then, is shown the two variable network for which the integral (2) 
was produced by quadrature: 


W = 0-00 (0:25) 1-25 and n= 3 (1) 20 

W = 1-50 (0-25) 2-75 n=3(1) 9 (2) 23.| (4) 
W = 3-00 (0-25) 3-25 n=3(1) 5 (2) 23. 

W = 3-50 (0-25) 8-00 n = 3 (2) 23 


For n = 2 the final integral P,(W) is given directly by the normal integral and may be 
obtained by interpolation in Table II of Tables for Statisticians and Biometricians, Part I. 
Using the notation of that table (Sheppard’s original notation) we have 


oe W 
I a Vv ) =a io}? 
2 
Moreover, for purposes of interpolation, use was made of the formal relation 
P(W)=1 for W>0. 
For fixed u and W and for values of n in the arithmetic progression (4), the integrand (3) 


is a geometric progression with 


( (*u 2 
z(u) ( | — dz) 


* The integrand was calculated to 7-decimal accuracy in order to obtain P,(W) to about 
5-decimal accuracy. 











310 Probability integral of range 


as leading term and ({" xz) dz] or (f* x(x) dr) 
J u-W u-W i 


as common ratio. This leading term as well as the common ratios were easily obtained from 
Table II of Tables for Statisticians and Biometricians, Part I and the terms of the progression 
were then automatically produced on a Mercedes calculating machine Model 38 M.S. and 
copied down in two-way tables with u as row heading, » as column heading and W as 
table heading. The values of the integrand were then checked by differencing column-wise 
and added to yield the main term of the integral (2). The correction terms which, according 
to Gregory’s formula, convert the integrand-sum into the integral were calculated from the 
differences and checked by the application of Gauss’ formula of integration. Finally, to 


obtain P,,( W) the term sur p 
(f 2z(2) ar) 
—1W 


was produced by continued multiplication on the Mercedes and added to the corresponding 
integral (2) to yield P,,(W) for all points of the above network. 
For odd values of n the integral P,(W) was then differenced W-wise on the National 
machine which, incidentally, produced column totals = P,,(W) for these values of n. Two 
Ww 


checks were applied at this stage. One consisted in inspecting the fourth order differences. 
As a second check, the mean range, @,,, was calculated from the formula 


8 
@, = 8— i) P,(w) dw,* 
0 


and compared with the correct mean range given in Table XXII of Tables for Statisticians 
and Biometricians, Part II. Finally, the function P,(W) was subtabulated to interval 0-05 
‘on the National machine by a method similar to that described in detail by L. J. Comrie 
(1936). 


The values of P,(W) for even n were then obtained by interpolation with the help of 
two interpolation formulae of Lagrangian type: 


2048P,(W) = — 5[P,_.(W) + Paso W)] + 49[Pp_s( W) + Prss( W)] 


is: 245(P,, _3( W) a Passl W)) + 1225(P,, _1( W) = Prsal W)), (5) 
20P,(W) = Pa_s(W) + Pris(W) — 6LP,_2(W) + Prio( W)] 
+ 15[P,_,(W) + Pasi(W)). (6) 


Formula (5) yields the interpolate for even n from the given values of P,,( W) at adjacent odd 
values of n. This formula was used throughout. In some cases, however, the resulting inter- 
polate was accurate to about 3 places of decimals only. In such cases values of P,_s, Pais, 
Pai» Pay, accurate to 5 places of decimals and values of P,_,, P,,. accurate to (say) 
3 places of decimals were substituted in formula (6). This yielded a ‘corrected value’ of 
P,(W). The process was then repeated for n = n+2 and so on until all values of P,(W) 
had ‘settled down’ for even values of n. It is-easy to see that the process is convergent and 
that the maximum error in the interpolate is 2 units for the 5th decimal. 

After completion of the interpolation n-wise, the interpolates P,(W) for even n were 
differenced W-wise, checked and subtabulated as for odd values of n. 


* This is true provided P,,(8)=1 to 6-decimal place accuracy. 


REFERENCE 
ComriE, L. J. (1936). J.R. Statist. Soc. Suppl. 3, 87-114. 














SS ——— 





NOTES ON TESTING STATISTICAL HYPOTHESES 
By E. S8. PEARSON 


1. In July 1939, a few weeks before the opening of the present war, a Con- 
ference on the Application of the Calculus of Probabilities was held at Geneva 
under the auspices of the International Institute of Intellectual Co-operation 
(League of Nations). At the public session at which a paper by Prof. J. Neyman 
was presented and also subsequently in some informal discussions, a number 
of questions were raised: 

(a) Inchoosing a test for a statistical hypothesis, is it possible or even necessary 
to specify the hypotheses alternative to that tested? Why should not a test be 
made to depend only on the form of law associated with the hypothesis tested ? 
For example, Newton’s hypothesis of gravitation was formulated and tested 
without any need to define alternative laws. 

(b) Is the method of approach to these problems advocated by Prof. Neyman 
and myself applicable to testing the appropriateness of probability laws or only 
to testing hypotheses regarding the numerical values of constants contained in 
these laws? 

After the conclusion of the conference, I set down some Notes for a few of the 
statisticians who had taken part in the discussions, hoping that at leisure they 
might feel stimulated to define their views on the subject more precisely. But 
almost before the Notes were despatched, war in Europe had intervened. The 
only reply which I received was from Prof. Gumbel, and this, after some un- 
avoidable delay, has now taken shape in the contribution printed on pp. 317-33 
below. In publishing this, it seems useful to add my own Notes, which are given 
with only minor verbal alterations in the following pages. They are in part a 
restatement with rather different emphasis of views expressed in a paper published 
four years ago (Pearson, 1938). 

2. With regard to one of the points raised under (a) above, it should be 
remembered that a statistical hypothesis as defined by Neyman and myself is 
a hypothesis concerning the probability law of random variables. The gravi- 
tational hypothesis of Newton is not a statistical hypothesis in the sense defined; 
statistical methods may be introduced to test the Newtonian hypothesis, how- 
ever, and they will involve tests of statistical hypotheses or ‘significance tests’ 
because it will be assumed that errors of observation exist which may be regarded 
as random variables, probably taken to follow the normal distribution law. 

For example, on the Newtonian hypothesis, the angular co-ordinates of a 
planet measured from the earth as origin may at certain moments be given as 
E=£,7 = 9, (t = 1,2,...). If we have a number of observations of position z,, y,, 








312 Notes on testing statistical hypotheses 


subject to observational error, the statistical problem will be to test whether 
these are consistent with the hypothetical position values &,, 7,, or whether they 
suggest that £, 7 have some other different values at the moments of observation. 
Thus the ‘alternatives’ that we have immediately in mind will be alternative 
values for £, 7, not alternative gravitational hypotheses. If, however, some alter- 
native law of motion were proposed, so that we could specify definite values 
£;, 9; alternative to the values &,, 9, of the Newtonian law, then undoubtedly we 
could choose a statistical test which would be particularly efficient in discrimi- 
nating between the two alternatives. Such a course became possible when the 
Einstein hypothesis was formulated and the orbit of Mercury considered. But 
the absence of an alternative gravitational law does not prevent us selecting a 
statistical test which will be (a) sensitive to departures in £, 7 from &,, 4, but 
(b) relatively insensitive to departures from normality in the distribution of 
errors. We should make this selection because, if the Newtonian law were incorrect, 
we believe that this would result in a change in &,, 9, but not in a departure of the 
distribution of observational errors from the normal law. 

This example, of course, concerns a statistical hypothesis regarding the values 
of two parameters £, 7, not regarding the form of a probability law of random 
variables. The following general approach shows, however, that the principles 
discussed may be applied to testing hypotheses regarding probability laws. 

3. Suppose that 2 is a continuous random variable and that H) is a statistical 


hypothesis which assumes that the elementary probability law for x is p(x | Hp) 
in the interval —0oo to +00. Thus 


+0 
} p(« | Hy) dx = 1. (1) 


— 


zx 
Now write y= [ p(x | Hy) dx. (2) 
y will be a non-decreasing function of x having values confined to the interval 
(0,1). Further, the elementary probability law of y will be 


Ply) = P(x) [7 = 1 for O<y<}, (3) 


or all values of y between 0 and | are ‘equally probable’. 

Suppose now that we wish to use a set of m independent values 2, <9, ..., 7, 
to test that the probability law is of the assumed form p(x | H,). It is clear that 
the hypothesis H, is exactly equivalent to the hypothesis, say hy, that the n 
values ¥;, Yo, ---, Yn, (obtained from the 2’s by the transformation (2)) have been 
sampled subject to the probability law (3). Just as the point (2,, 2 , ...,%,,) may 
be represented in an unlimited n-dimensioned space having probability density 


(2%, %q, ...»%_ | Hy) = 


Il 
=1 


u 


{ p(x; | Ho)}, (4) 


























E. S. PEARson 313 


if H, is true, so the point (yp, y;, .-., y,) may be placed in an n-dimensioned hyper- 
cube with sides of unit length and with uniform probability density, if H, and 
therefore h, is true. It follows that if H, is what has been termed a ‘simple hypo- 
thesis’, i.e. specifies the form of p(x | H,) completely,* then the test of H, may 
always be transformed to the test of A,. If then it were correct to say that the test 
of a statistical hypothesis depends only on the form of the law specified by Hp, it 
follows that for the type of situation considered the testing of a statistical hypo- 
thesis could always be reduced to the following simple problem: 

To test whether a sample of n independent random variables y,, Yo, ..-, Y, 
(0<y;< 1) has been selected from the so-called rectangular distribution, i.e. the 
distribution for which p(y) = 1, (0<y<1). 

4. We are at once faced, therefore, with the question of how to test this 
simple but apparently fundamental hypothesis. If h, is true, the sample point 
is equally likely to fall at any point within the n-dimensioned hypercube. Thus 
in picking out the critical (or rejection) region in this space we can get no assist- 
ance whatsoever from the changes in probability density, as we might do in 
the x-space. If we wish to use a level of significance of a (say a = 0-01) for rejecting 
ho, it is clear that an infinite number of critical regions satisfying this condition 
are available; it is only necessary to select a region whose content is a. 

If we consider the n values of y and plot them in the interval (0, 1) as follows, 

ee ee * e ee . * s 
— 


L 
T T 1 i T i T Ly T i  . 


Fig. 1. 





the great majority of samples, from a rectangular distribution, at any rate if n 
is not too small, will be spread out fairly uniformly throughout the interval. 
Perhaps an ‘ideal’ sample against which to measure irregularities might be 
described as one for which the values of y fell at 
1 : 5 2n—1 
+sng 


2n’ 2n’ Qn’ 2n 
But what form of departure from this ideal of uniformity are we to pick out as 
suggesting that the hypothesis h, is disproved? Should we judge significance by 
paying attention to the value of the mean y, of the variance, of the range of 
variation or of higher moments? Or should we use the y? or w? tests? It seems 
difficult to find any basis for choice which could be regarded in any sense as the 
‘best’. For any set of values y,, Yo, ..., y, some critical region of size a can always 
be found which will contain the sample point and therefore lead to the rejection 
of hy. Indeed, the task of selecting a unique region on any rational basis would 
seem to be insoluble. 

* This condition is important. If the values of certain constants contained in the probability 
law need to be estimated from the observations, then the’n values of y will not form a true random 


sample from a rectangular distribution. They will be subject to certain limitations to their degrees 
of freedom, though these may be relatively unimportant if » is large. 











314 Notes on testing statistical hypotheses 


5. Directly it is recognized, however, that the choice of a test of a statistical 
hypothesis depends on something more than the form of the law associated with 
that hypothesis, it can be seen how a solution may be obtained. If we can specify 
a single alternative H, to H, or a class of alternatives C(H), then we shall have 
also an alternative h, or a class C(h) to hy. Thus, if p(x | H,) denotes a probability 
law alternative to p(x | H,), then for y the alternative is 


dy _ p(x| H,)| 
hy) = p(x | Ay) (= = —— a 
Peed = 9 RO Sar 
where f(y) means the solution of 


y=|" ple| Hyde (6) 


with regard to x. For example, Fig. 2 shows three typical forms of alternative 


Ply |h,), p(y|h.) and p(y | hs) associated with alternatives p(x | A), ..., ete., to 
p(x | Hp). 


p(a|H,) 7, 


for O<y<\l, (5) 








Solid curve represents p(«|H,) 
































‘ 
| hy) p( he). |} : 
plylhy J y| *‘ p(y|h3) § 
Si Ne \ z als ) 
/ / \ 
7 xt ve - / \ 7 
Bs \ N Ps / so 
\ bias / 
/ \ / 
! \ U 
0 








| 0 | 0 
Solid rectangle represents p(y|h9) 
Fig.-2. 


We can now see the kind of test which will be most efficient for testing H, 
with regard to possible classes of alternatives. If the alternative laws are of 
smaller dispersion (as p(x | H,)), we must be on the look-out for too many values 
of y near } and too few near 0 and 1. For alternatives with greater dispersion 
(as p(x | H,)), we must reject H, when there are too many y’s near 0 or 1 and too 
few near }. While if the alternatives are likely to be asymmetrical curves (as 
p(x | H;)), then a different rule will be needed, as suggested by the p(y | hg) curve. 

6. It follows that in so far‘as it is possible to formulate the class of admissible 
probability laws p(x|H), the problem of selecting the most efficient test of H, 
reduces to that of choosing a critical region in the n-dimensioned hypercube 
which is most effective in detecting, from a sample of n values of y, differences 
between the rectangle p(y|h,) and the appropriate alternative forms p(y | h). 


' 














E. S. PEARSON 315 


If H, is a single admissible alternative, then it has been shown (Neyman & 
Pearson, 1933, p. 298) that the region w, of content a in the hypercube, within 





which n 
II p@; | ho) 
= — <k, (7) 
TT p(y | A) 
= 1 
or, in view of equation (3), Il ply; |41)> 7; (8) 
i=1 
where kischosenso that P{(y,, Yo, ---,Yn)€EW| Apt =a (9) 


has the following property. 

Of all regions of content a, wy is more likely than any other to include the 
sample point when h,, and not ho, is true. The region has been termed the best 
critical region for testing h, with regard to the alternative h,. 

As soon as H, and H, are specified, clearly p(y |h,) and therefore the region 
w, can be found, although mathematically it may be rather difficult to determine 
the appropriate boundary Tl p(y; | k,) = constant, so as to satisfy (8). Since this 


t= 
product is the probability density in the hypercube given by A,, it will be seen 
that what we set out to do is to include in the critical region those parts of the 
sample space where the density for h, is highest. It is here, on repeated sampling, 
that sample points would tend to be concentrated if h, is true, instead of being 
uniformly distributed as under hp. 

7. If instead of a single alternative h,, there is a class of admissible alter- 
natives C(h), there may or may not be common points of concentration that can 
be included in the critical region. This will depend on whether the inequality 
(8) above defines a region independent of the particular hypothesis h of the class 
C(h). Even if there is no single region of content « which is exactly a ‘best 
critical region’ for hy with regard to all members of C(h), the general principle 
may still be used as a guide. We build up a critical region out of those parts of 
the hypercube where the probability density tends to be concentrated when the 
probability law departs from p(z | H,) in the direction of the alternatives included 
in C(H#). 

For example, in my earlier paper (Pearson, 1938) I suggested as appropriate 
in the following situation a test which, while not based on a common best critical 
region, was selected so as to include regions of greatest density associated with 
alternatives of C(h). For the hypothesis tested, 


p(x | H) = Vem) 


The alternatives are asymmetrical curves with the same mean and standard 
deviation as (10). A typical alternative would be the Type III curve 


e~t*. (10) 


_. 
p(x | H) = c(1+ dxf) eva, (11) 


Biometrika xxx 21 











316 Notes on testing statistical hypotheses 


whose form departs more and more from (10) as ./f, increases from zero, but the 
class need not be defined as precisely as this. In this problem it appears that 
if n independent observations 2, X2, ..., x, are available, the following is a good 
test of H,. Take as test function 


n 
Q = IW) (12) 
where y;, = 5(0-2—y;) for O<y,<0-2, 
y, = 5(y;,-0-2) for 0-2<y,<08, (13) 
y,; = 5(1-y,) for 0-8<y;<1, 
z 
-_= ee a 14 
and Yi | _odaai* dx (14) 


If H, is true it may be shown that —2log,Q is distributed as y* with 2n 
degrees of freedom. Hence any desired significance level «, for Q, may be found. 
We should then reject H, when Q is significantly small. 

A more systematic method of dealing with such problems has been con- 
sidered by Neyman (1937) in his paper on ‘smooth tests’. 

8. To sum up, the position seems to be this. It has often been argued that a 
statistical test need only depend on the form of the prebability law associated 
with the hypothesis tested. In the case where H, concerns the probability law 
of a single random variable and where p(x | H,) is precisely specified, by the trans- 
formation from 2 to y it has been shown that the problem of testing H, on the 
basis of n independent values of x can always be reduced to another problem, 
which involves this question. Can we regard a sample y;, Yo, ..., y, a8 having 
been drawn from the rectangular distribution p(y |h)) = 1, where O0<y<1? We 
are faced with a single fundamental question and we have to consider whether 
it can be answered in a rational manner, uniess we are prepared to take into account 
the kind of departures from the rectangular law that we either believe possible 
or at any rate consider it most important to be on the look out for. 

The transformation from x to y seems to have the advantage that it con- 
centrates attention on the main point at issue. That is my reason for emphasizing 
it in these Notes. Most of us have many preconceived ideas about appropriate 
tests if the probability law is taken in the form of p(x | Hy); we are accustomed to 
use the mean, the standard deviation, certain functions of moments, the x* test, .... 
But we are not so accustomed to test whether a sample comes from a rectangular 
distribution and we are therefore forced or, indeed, more willing to reconsider 
from first principles what course we should follow and why. 


REFERENCES 


NEYMAN, J. (1937). Skand. Actuar. Tidskr. 149. 
NEYMAN, J. & PEarson, E. S. (1933). Philos. Trans. A, 231, 289. 
PEARSON, E. S. (1938). Biometrika, 30, 134. 

















317 
SIMPLE TESTS FOR GIVEN HYPOTHESES 
By E. J. GUMBEL 
New School for Social Research, New York 
CONTENTS 

PAGE 
1. Control curves and the probability integral transformation . : - ous 
2. Classical tests applied to a uniform distribution : : : - > 
3. The mth point test : ‘ ; : : ‘ : . ‘ . 326 


In dealing with the problem of testing statistical hypotheses J. Neyman (1937) 
and E. 8. Pearson (1938) have considered the use of the probability integral 
transformation, which leads to a theoretical uniform distribution. This method 
presupposes that the usual comparison between theory and observations has 
already been applied. We shall first improve this comparison by introducing 
control curves. Then we shall apply to the uniform distribution the usual methods 
and the control curves. This will lead to simple tests for given statistical hvpo- 
theses. 


1. CONTROL CURVES AND THE PROBABILITY INTEGRAL TRANSFORMATION 


Let x be a continuous random variable for which n observations have been 
made. Let x,, be the observed values arranged in increasing order of magnitude, 
m (1,2,...,”) being the serial number. The simplest way of representing the 
observations is to plot the cumulative histogram z,,, m. The relative number 
W© (z,,) of observations less than or equal to z,, is given by 

m = nW(z,,). 
The consecutive differences 
nWOe,)—NWO(2,) (L<m) 


constitute the observed distribution. Many statisticians present, instead of the 
original observations z,, , only the number of cases within certain arbitrary classes. 
From the practical standpoint this means a simplification, from the theoretical 
standpoint a complication. We shall suppose that all x,, are known. 

The choice of a probability density to be applied to the observations con- 
stitutes the hypothesis. The probability density w (x, c¢,,¢ , ...), where C,, Ca, ... 
are the constants, is called the theoretical distribution. For sake of simplicity it 
is assumed that all observed values x,, have the same theoretical distribution. 
The probability W(x, c,, cy, ...) of a value equal to or less than z is given by 


ex 
W (2, Cy, Cg, .--) =| w(z, Cy, Ce, ...) dz, (1) 





318 Simple tests for given hypotheses 


where z is the variable of integration. It is customary to compare the observations 
Lm, m with the cumulative frequency curve x, nW(x). This comparison can be 


improved in the following way: The mth observation is a statistical variable 
distributed according to 


1D,(x, 2) = (*.) mW (x)"-*(1 — W(zx))"-™ w(2r). (2) 
In a previous article (Gumbel, 1935) it has been shown that, for an ordinary 
unlimited distribution, with large n and with m of the size }n, the distribution 
(2) converges towards a normal distribution with a mean given by W(x) = m/n 
and a standard deviation 
a _ 
w(x) n : 
This formula does not contain m explicitly. Since each theoretical value x can 
be interpreted as an mth value, (3) gives its standard deviation. The interval 
x +o will be called the control interval. Under the above condition, the probability 
that an mth value will fall within the control interval is about }. The two curves 
obtained by plotting x + 7, nW(x) will be called control curves. 

For a given initial distribution we shall have to find the mean and the standard 
deviation of the mth value which may differ from those of the general solution, 
especially if nis small. The control interval will be a certain function of w(x). 
Also the probability associated with the interval « + o may differ from that of the 
general solution and may depend upon x. For the exponential distribution 
(Gumbel, 1937) the precision diminishes with increasing values of the variable. 
Below we shall apply this control to the uniform distribution w(x) = constant. 

The calculation of the probability W(x) and the control curves can often be 
simplified by an indirect method. For certain, but not all, distributions it is 


possible to eliminate the constants by introducing a new variable y as a function 
of x, where 


x = f(y, Cy, Cg, -.). (4) 


Accordingly, the probability that a value of the transformed variable will fall 
in the interval y to y+dy is 


w(x) dx = w[f(y)]| f ‘(y) | dy. 


We call ply) = wl fly)) | fy) | 
| da | 
or Ply) = w(x) 7» (5) 





the distribution of y. The probability V(y) of a value equal to or less than y is 


V(y) = W (x, cy, Cg, oae}e (6) 


The transformation (4) is chosen such that the expression V(y) does not contain 
any constants which depend upon the observations. Therefore V(y) can be 

















E. J. GuMBEL 319 


calculated once and for all as a function of y. Such tables have been calculated 
for the distributions for which this reduction is possible. 

In order to compare the cumulative histogram z,,, m with the cumulative 
frequency x, n W(x), and to use the control curves, it is first necessary to compute 
the constants ¢,, c., .... If the method of moments is used, the area between the 
cumulative histogram and the horizontal line W = 1, the arithmetic mean, is 
conserved. The value of the variable x which corresponds to a selected, numerical 
value of W(x) is obtained from the transformation (4). 

A special case of the transformation (4) which leads to a reduced distribution 
of astounding simplicity is the probability integral transformation due to Karl 
Pearson (1933) who introduced for y the probability function W(x). Since 


dx 1 
dW = w(a)’ ee 
we obtain from (5) p(W)=1. (8) 


This identity means that the distribution of the probability is constant. As 
p(W) is the probability density of a probability, it is difficult to establish its 
philosophical meaning. But formally the construction is valid and the corre- 
sponding value can be observed. It is our purpose to give several methods of 
judging the significance of the differences between the theoretical distribution (8) 
and the corresponding ‘observed’ distribution. 

The word ‘observation’ will be given a special meaning. A certain theory hy 
which involves the choice of certain constants ¢,, c,, ... applied to the observa- 
tions, leads to the values 


We = W(2m; Cy, Cq, ---| Rg) (m= 1,2, ...,2), 


corresponding to x,,. These values, contained in the interval 0, 1, are the ‘obser- 
vations’. To any other set of constants c}, cj, ... will correspond other ‘observed’ 


values W6 = W(2m> C4» Cg» «++ | hg): 


m 


Therefore any test applied to the ‘observations’ of formula (8) might be used to 
judge the choice of the constants. To another hypothesis h, containing the con- 
stants d,, d,, ..., will correspond another set of ‘observed’ points 


W, = W (m4, de, wei | h,). 


The same observations when interpreted by different theories or different 
constants lead to different ‘~bservations’. An incorrect theory involving properly 
chosen constants might give better results than a correct theory involving im- 
properly chosen constants. Therefore, to compare different theories, the constants 
for each must be determined with the same precision. In practice this condition 
will not always be fulfilled. For the precision of the characteristics depends upon 
the distribution for which they are calculated (Gumbel, 1936). Therefore, the 








320 Simple tests for given hypotheses 


same method of determining the constants might lead to different degrees of 
precision for different distributions, whereas different determinations of the 
constants might lead to approximately the same precision. 

There is another point for caution: all tests derived from the probability 
integral transformation apply to the analytic form of the hypotheses and at the 
same time to the choice of the constants. A formula containing many constants 
may reproduce the observations more closely than a formula containing few 
constants, even though the constants in the first hypothesis have no meaning. 
Therefore we must limit our comparison to hypotheses containing the same 
number of constants. For any set of statistical observations in the ordinary sense, 
there will usually correspond a small number of tenable hypotheses. We shall 
suppose it is known what they are. For we do not try to find a formula for the 
sake of doing it, but to explain the observed facts. We will not go so far as Neyman 
(1937), who formulated all possible alternatives by a series of orthogonal 
functions. 

In theory the points representing W(z,,) are distributed uniformly in the 
interval 0, 1. This is true for any hypothesis, provided the variable is continuous. 
But in practice this will never occur. The ‘observed’ points corresponding to any 
given hypothesis will differ from the theoretical set, even if the hypothesis is a 
very good one. The differences between the ‘observed’ set of points resulting 
from h, and h, and the theoretical set allow the construction of tests which 
can be used to judge which of two given hypotheses is the better. But no 
statistical method gives an answer to the question whether or not a hypothesis 
is true. 

After a hypothesis has been selected, the preliminary steps which have to be 
made before it is possible to use the probability integral transformation, are: 
first, the determination of the constants; secondly, the calculation of probabilities 
W (a) for the values of x given by the transformation (4); and thirdly, the calcula- 
tion of the probabilities W(x,,) of the observed values. It is only after these three 
operations have been carried out that we obtain the ‘observations’ which are 
to be compared with the theory (8). Therefore, any test based on the probability 
integral transformation presupposes the usual comparison of the observed 
cumulative histogram with the frequency curve. In many cases this comparison, 
checked by the control curves, will indicate a clear superiority of one of the 
theories. If this is true, there is no necessity for a new test. 

It would be interesting to investigate the best criterion for judging the 
significance of the differences between the ‘observed’ and the uniform distribu- 
tion (8). But for practical purposes it is sufficient to know whether the differences 
for hy are smaller or larger than for h,. First we shall establish rough measures of 
comparison; afterwards, more refined ones. 




















E. J. GuMBEL 321 


2. CLASSICAL TESTS APPLIED TO A UNIFORM DISTRIBUTION 


The comparison of an observed distribution of a continuous variable with the 
theoretical distribution is reduced by the probability integral transformation to 
a comparison of ‘observed’ points with a uniform distribution. It seems logical 
to use first the classical methods which are here very simple, as no constants have 
to be determined. For a uniform distribution 


py)=1 (OSyS)), 
the arithmetic mean and the median are, respectively, 
¥=§ =}. (9) 


The mean error 6 and the probable error p, defined as half of the difference between 
the two quartiles, are 


06=p=}. (10) 

The kth moment about the origin is 
1 
M,. = —— 
7 eer 

which gives the recurrence relation 

1 1 

Se ae 11 

M,.., ©, (11) 


Since the distribution is symmetrical, the odd moments about the arithmetic 
mean vanish. Therefore 


B, = 0. 
The even moments are 
“1 ra 1 
Hox < ,a- 4) dy = . ae Or fy. = 922k 1)" (12) 


Therefore the standard deviation, the coefficient of variation and the second 
beta are, respectively, 


1 
c=—., (13) 
2, 
1 
v=—, (14) 
3 
and By = . (15) 


It is necessary now to calculate the ‘observed’ means, the measures of dis- 
persion, and the relations between successive moments. To control the agree- 
ment between the theoretical uniform distribution and the ‘observed’ points 
we can still employ the standard error oj of the arithmetic mean. The general 
formula 





322 


Simple tests for given hypotheses 


1 
aes. Paty 16 
becomes according to (13) %=5 an) (16) 
— | 
The standard error of the dispersion for n large is o(o7) = J oes, which 
becomes, according to (12), 
i ae 1 
oe oe Pe 17 
ao") = Ila (5 a) 6 (in) a7) 


It seems reasonable to employ these old-fashioned tests before the use of 
more sophisticated methods is resorted to. Only if they fail would it be necessary 
to consider more elaborate methods. 

The n ‘observed’ points W(z,,) represent the probabilities, obtained from the 
hypothesis hy, of the given observations in the ordinary sense, z,,. We plot these 
points in the interval 0, 1 which is divided into k cells of equal length, where k 
is chosen in such a way that n/k is an integer. If n is a multiple of 10, we choose 
the cells (0-0, 0-1), (0-1, 0-2), ..., (0-9, 1-0). 

The probability density of a point falling somewhere within the interval 0, 1 
is constant. As the interval is of length 1, this density is 1. Therefore, the prob- 
ability of a point lying with a given cell is 1/k, and the expected number of 
points in each cell is n/k. The ‘observed’ number of points obtained through a 
hypothesis hy will be a, (v = 1, 2,...,&). If we apply another hypothesis h, to the 
same observations or introduce other numerical values for the constants, the new 
set of ‘observed’ points will lead to values b, which, in general, will differ from a, . 

The classical statistical method of treating this material is the y? test. As the 
numbers a,, b, will differ from the expected number n/k, we can calculate for both 


hypotheses kk n\? k n\? 

B= 7 E (ea) t= Z (2-9) os 
The better hypothesis will have a lower value of x? and a greater value P, where 
P denotes the probability of obtaining the ‘observed’ deviations from uniform 
distribution or larger ones. The probability P depends upon the number of cells 
chosen. Therefore, to compare two competing hypotheses, the same division must 
be used. 

The application of the x? test to the ‘observations’ W(z,,) eliminates an 
arbitrary action which is a serious and well-known drawback of the y? test, when 
applied to the original observations x,,. The expected contents of the classes 
depend upon the distribution. Therefore certain classes, as a rule the first and the 
last, must be chosen such that the expected number is not too small, otherwise 
x? becomes very large. In our case, no cell differs from any other and no arbitrary 


combination of cells is needed. We can choose k = n. The mean number of points 
in each cell will then be one and 


(18°) 

















E. J. GUMBEL 323 


This choice removes another drawback of the y? method: different classifications 
used for the same observations lead to different shapes of the distribution and 
therefore to different values of y?. Here the classification is prescribed once and 
for all. 

Another comparison between theory and ‘observation’ may be based on the 
fact that different sets of points a, and b, have different probabilities. The prob- 
ability that a, points will fall within the cell v (= 1, 2, ..., &) is 


n! & | 4, +09+---+0y 
aetna (i) ; 
where Q,+4,+...+a, =n. 
Since the factor n!k-” is constant, it is sufficient to investigate 
1 


ll ==—-. (19) 
I] a,! 


v=1 





Of course /7 < P, as the latter probability applies to the ‘observed’ deviation or 
larger ones. The statement ‘The probability for a, points to be contained in the 
cell vy is proportional to /7’ may be inverted according to Bayes’s principle. 
Therefore, /7 is proportional to the probability that the distribution of points 
is rectangular, i.e. that hy is a good hypothesis. 

The question for which set @, the probability /7 is maximum, is the starting- 
point of the classical relation between entropy and probability. For large n the 
most probable set of points is the one which has the same number of points in 
each cell, i.e. 

@, =a, =... =a =F. (20) 

Let us call /7,,,. the probability which corresponds to this distribution. The 

probability of the hypothesis h will be greater, equal to, or less than the prob- 

ability of h,, if 
« Si 2 

Taig * Wa win 

The relative probability of both hypotheses will be /7,//7) or [7,/IT,, depending 


on whether 
IT, 2 If,. 


As these probabilities depend on the number of cells, the same division must be 
used to compare two competing hypotheses. We can choose k = n. Then /7,,,,. = 1 
and we can use //, as test: 

The entropy test (21) is closely related to the x* test (18). This classical 
relation can be obtained in the following simple way: if g is the constant prob- 
ability of a point falling within a given cell, then for n observations the expected 








324 


Simple tests for given hypotheses 


number of points within a cell is ng. But the ‘observed’ numbers a, will differ 
from the expected number by e¢, so that 


a, = nq+eé,, 
where é, = 0. 


The quotient (21) becomes 


II, _ * (ng)! 
| | aS v=1(nqte,)! 


When x is large, each factor becomes, by application of Stirling’s formula, 


c.. 1 (2) ore ( “)|. 
eek cyl io exp| (nq+e,+4)In 1+%) 


Expansion of the logarithm leads to 


nq! =( é y" é, e 
= —] exp] —¢6,—- —-— 
(nq +e )! nq, 2nq 2nqg 


According to the meaning of ¢, we obtain 


k - 
ie. a exp| - 5 ( x | te me y! 
| - = \ ng 


IT - 
whence o- = eh? 





qo =e. (22) 


max. 


Therefore, when n is large, the entropy test becomes identical with the x? test. 
This result was derived by Neyman & Pearson (1928), when they showed that 
the x? test followed from their ‘likelihood ratio’ method of approach. 

Neither test will give an answer, if the number of points a, assigned by h, to 
the cell v is equal to the number of points b, assigned by h, to the cell A, a, = by, 
where for any v (= 1, 2,..., &) it is possible to find a A (= 1, 2,...,&), such that not 
all A = v. An example of this occurring is shown in Table 1, col. C and F. The 
reason for the failure is that we do not make use of the actual position of the 
observed points within the cells. We only ask in which cell they are situated 
Although in such a case the tests do not show any difference between the arrange- 
ments a, and b,, some conclusions might be drawn from such ‘observations’. 
If the number of points falling in the first few cells and also in the last few cells 
is disproportionately large, and if there is a deficiency in the middle cells (Table 1 
col. D), we have to conclude that the distribution h, is too concentrated or that 
we have chosen too small a value for the constant which depends only on the 
standard deviation. If the number of points in the cells at either end is small 
(Table 1, col. E), the inverse inferences follow. These considerations may give a 
hint about the choice of an alternative hypothesis. 














E. J. GUMBEL 325 


To illustrate the above methods, let us take the fictitious example given by 
Pearson (1938) in his Fig. 2, p. 136. He arranges x = 10 points in k = 10 cells 
and considers six sets A, B, ..., F, given in Table 1. 

Let us suppose that these six sets are the results of six different hypotheses 
applied to the same observations. The y? test leads to 


P,>Pp>Po = Pr> Pp = Pr. 
The probabilities of the various columns give the same ordering 
a = IT ,>Hp> Ie = IT, > ITp = IT,. 


The most probable set contains one point in each cell (set A). It is not possible 
to decide whether C is more probable than F, and whether B is more probable 
than E. 


Table 1. Pearson’s set 





Class A B C D E F 
0-0-0-1 l 2 0 2 0 0 
0-1-0-2 l 3 0 l 0 1 
0-2-0-3 l 2 0 2 1 2 
0-3-0-4 ] 1 0 0 2 2 
0-4-0°5 l 0 l 0 2 0 
0-5-0-6 1 1 2 1 3 1 
0-6-0:7 1 0 1 0 1 0 
0-7-0:8 1 1 2 1 1 0 
0-8--0-9 1 0 2 1 0 2 
0-9-1-0 1 0 2 2 0 2 

x? 0 10 8 6 10 8 

l 0-350 0-534 0-740 0-350 0-534 

TT l 1 1 1 1 J 
24 16 8 =4 16 


The x? and the entropy test are based upon the same data. But the results 
reached are incomplete, as artificialities are introduced by the classification of 
the ‘observations’ into the arbitrary cells. The actual position of the points 
within the cells is not used. The set A shows that these tests may be misleading 
in still another way. Each cell in set A contains exactly the expected number. 
But it would be false to conclude that the hypothesis is true, since the actual 
positions of the points within the cells might differ from the ideal positions. 

Let us suppose we know these positions. It might then happen that the 
difference between the observed and the ideal positions of the points is smaller 
for a set K than for a set L, even if the differences between the actual and the 
theoretical number of points is larger for K than for L. 

It is now our task to assign a meaning to the term ideal position and to define 
a measure of the differences between the ‘observed’ and the ideal set. 





326 


Simple tests for given hypotheses 


3. THE mTH POINT TEST 


The ideal position of n points, distributed with uniform probability over the 
interval 0, 1, is such that the distances between consecutive pairs of points are 
equal. But there are a number of ways of distributing n points equidistantly 
over the interval 0, 1. E. S. Pearson, in the preceding note, suggests that 

2m—1 
ae 23 
Im = ay (23) 
might be used as the ideal position of the mth point. However, as y is a statistical 
variable, we should represent it by an average, to choose which we must consider 
the distribution of the mth point.* Any observation chosen at random has the 
same probability of falling on any position y within the interval. But for the mth 


point this probability depends upon.y and m. The initial distribution w of the 
variable W(x) = y is constant. According to (8) 





wy)=1 (OSyS}). 


The probability of obtaining a point equal to or less than y is y. According to 
(2) the distribution of the mth point is 


v,,(Y,”) = (") my*—(1—y)*-™. (24) 


The distribution (24) is of Karl Pearson’s Type I. For m = 1 (and m = n) the 
distribution will only decrease (increase). The distribution of the mth point is 
equal to the distribution of the (n—m-+ 1)th point. If we replace y by 1—y 

Wr—m+i(1 =F n) = WmlY; n). (24’) 
The most probable position 7 of the mth point is given by 


n—-m m-l1 


a 
—] 
which leads to J, = ~ I: (25) 


For given values of m the median position can be obtained from the tables of 
the Incomplete Gamma Function. To find the arithmetic mean 7 and the control 
curves, it is necessary to have the moments M, of (24). They are 
n —_—_ 
M, ~_ m ymrk -1(] —y)-™ dy. 
m 0 
According to the well-known properties of the Gamma function 


4 


i n! _(m+k—-1)!\(n—m)! sn! Ss (m+ k-1)! 
k (m—1)!(n—m)! (n+k)! ~ (n+k)! (m—-1)! 
* [The distribution of a ranked individual sampled from a rectangular population, and the 


moments of this distribution, were obtained by Karl Pearson in the first of two papers (1931, 
p-. 390, and 1932) dealing with ranked variates. Ep.] 





‘ 








E. J. GuMBEL 327 


m+k 
Therefore M,., = M, rae ee (26) 
For k = 1 the arithmetic mean of the mth point is 
m 
ee 27 
Ym n+ . ( ) 
and for k = 2 the second moment is 
m+ 1 
d = ¥,, —— . 
fs = Ym n+2 
. . . m(n—m-+ 1) 
Finally, the variance = —— (28) 


~ (n+1)?(n+2)° 
These formulae apply also to the cases m = 1 and m = n. The standard deviation 
of the mth value may be written 
Ym(1 — Ym) 99° 

= [a . (28’) 
This formula differs slightly from the general expression (3), and leads to an 
unexpected result: as we approach the centre from either side, the precision of 
the mth point decreases. The precision of the mth point will be a minimum for 
m = in+1, if nis even, and for m = }(n+ 1) if n is odd. The values of g,, ,/(n + 2) 
are given in Table 2. 


Table 2. Standard deviation of the mth point 








Ym Ym Om (n+2) 
0-05 0-95 0-21794 
0-10 0-90 0-30000 
0-15 0-85 0-35707 
0-20 0-80 0-40000 
0-25 0-75 0-43301 
0-30 0-70 0-45826 
0-35 0-65 0-47697 
0-40 0-60 0-49000 
0-45 0-55 0-49750 


0-50 0-50 0-50000 | 


We must now decide whether to use the mean (27) or the mode (25) as the 
ideal position of the mth point. The modes of the first and of the last points are 
0 and 1 respectively, whereas the corresponding means are 

1 
et) 
As the ‘observations’ W,(x,) and W,(z,,) of the first and the last point differ 
from 0 and 1, the arithmetic mean is to be preferred. 


1 
oe a oe 
Y n+l’ Yn 








328 Simple tests for given hypotheses 


Formula (27) gives the ideal position and therefore the theoretical numbers 
of points in each cell which can be compared with the ‘observed’ numbers. This 
method leads to an improvement of the tests (18) and (21) where the choice of 
the cells was still arbitrary and where the actual position of each point was not 
taken into account. 

Besides comparing the uniform distribution with the ‘observed’ position of 
the points we can use the corresponding cumulative frequency. The probability 
scale y is plotted as abscissa and m as ordinate. We count the number of points 
below m. The mean position ¥ of the mth point becomes a straight line differing 
from the diagonal which represents the modal positions. The figure opposite 
traces, for n = 20, the mean, the modal and Pearson’s position of the mth point 
given by (23). 

In the same way we plot the ‘observed’ points obtained by hp, h,, .... These 
probability points W,(z,,), Wi(x,,) will be scattered about the straight line. Usually 
the area between the observed cumulative frequency curve, the ordinate and the 
parallel to the abscissa is kept equal to the corresponding area for the theoretical 
curve. Since for the present problem no constants have to be determined, we 
have no way of enforcing this equality. The area J bounded by the diagonal 
straight line through the points with the co-ordinates m/(n + 1), m(m = 1, 2, ...,n), 


the length 1/(n+ 1) to n/(n+1) of the abscissa axis and the two parallels to the 
ordinate axis, is J =i(n—1). (29) 


This might differ from the area J of the n— 1 ‘observed’ trapezes 
Yn: mM mM+1,y,., (m=1,2,...,.n—1). 
As y,, are the ‘observed’ points 


n—-1 
JO = ¥ (m+ 4) (Yai — Ym) 


m=1 


n n—1 n 12-1 
— LY m(™ — }- ~ MYm+ 5 UYm—5 ~ Ym 


= o> Ym + (Yn —Yy). 


If we replace each value y,, by its expectation from (27), we get 


J® — ie {n*— 3[n(n + 1)]+ 4(n—1)} = = 
n+1 5 P 2(n +1) 


= 3(n—1), 


(29’) 
as it ought to be. The ‘observed’ area is not equal to the theoretical area, but its 
expectation is. It might happen that the numerical value of J is very close to J 
as a result of compensating deviations. Therefore this numerical comparison can 


be used as a test only in connexion with the graph of the ‘observed’ and theoretical 
cumulative histogram of the mth points. 





— 
Le 


—_——— 








Ne 


SN ee 


a 


SCO 





E. J. GuMBEL 329 


To control the agreement between the ‘observed’ cumulative histogram and 
the ideal straight line we use two control curves through the points ¥Fo, m, 
where 7 is given by (27) and o by (28’). They are traced in the figure for n = 20. 












































20 
16 
. 12 
FE 
E 
z 
) 
0 0:2 0-4 06 0-8 1-0 


Probability y= W(z,,) 
Fig. 1. Control curve for uniform distribution. 


Mean mth point ¥ (27) ———- Modal mth point 9 (25) 
— — — — Pearson’s mth point y (23) —|—\—| Control curves 





It is interesting to calculate the area A bounded by these two curves and to 
compare it with the area J. If we consider m as abscissa and ¥+o and ¥—@ as 
ordinates we have, for n sufficiently large, 


A 


["iy+o-G—oylam 


Jil 


rn 
2| odm. 
1 


If we introduce 7 as variable of integration we obtain from (28’) 


w shy 


— J(n + 2) Rie —y)} dy. 


1/(n+1) 





330 Simple tests for given hypotheses 


The transformation 


y=sin*t, 1-—y=cost, dy = 2sint costdt, 
leads, as is well known, to 
(n +2) 
N . . 
rr’ a A= Ht —sinf cos thor . 


The limits are given by 


‘ 1 : 1 
La f-— La nins-e. 
—- A ieee = Jl a 


For the expansion of f, it is sufficient to put 





, 1 1 1 
t = arcesin /—— = —_ —————— 
. laa J(n+1) 6(n+1)J(n+1) 


provided (n+ 1)?>1. Under the same condition, 
arc sin ,/(1—2*?) = arccosx 


= $n-arcsing, 
becomes for any |x| <1 


j -_ 1 
arc sin ,/(1—2a*) = Jm—2+32%. 


1 1 1 
Therefore i a s00t aegip rcnitinn: fp 2etnapaiitesercintel 
12 J(n+1) 6(n+1)(n41) 


so that = aS ie 1 
; 20 4 V(n+1)° 6(n+1)A(n+1)’ 
The second factor in the brackets becomes 


2 sin ty ,/(1 — sin* ty) (cos* ty —sin®t)) = 2 sin ty ,/(1 —sin? tg) (1 — 2 sin? t,) 


= 2 leei(-sei)|(!-s) 


3 Jn (n _ 1) 
(n+ 1)? 





In the same way 

ate haha) a © -—*|}| __2n \_ _2n(n—1) 

sin 2t, ./(1—sin? 2t,) = 2 era! n+l) 1 awe (n+? * 
Finally, we reach the area bounded by the control curves 





m (n—1)Jjn 1 l 


n+l 
si Pane “(n+1)? (ntl) * 6n+ ea 
According to (29) the ratio of the area bounded by the control curves to the area 
of the cumulative histogram 


A_n+l 2 (7 (n—1).,/n 1 1 


J n—1d(n+2)\4° (n+1)? ~— J(n+ 1) * 6(n+ 1) J(n+ 5): ine. 
converges towards zero as 1: ,/n. 














| 
| 
| 


a 


| 





SE EEE 





E. J. GuMBEL 331 


The properties (29’) and (30) of the cumulative histogram of the positions of 
the mth points allow for the comparison of the ‘observations’ W,(z,,) and the ideal 
points m/(n +1). It will often be sufficient to inspect the deviations between the 
‘observations’ and theory to judge which set of ‘observed’ points is closer, 
on the whole, to the theoretical positions. It seems legitimate to prefer a hypo- 
thesis hy if the control area contains more points for h, than for h,. 

In order to secure a numerical test, we can introduce the mean S? of the sum 
of the squares of the differences between the ‘observed’ values y,, = W,(z,,) and 
the mean positions %,, = m/(n +1). Take 

une k n j m 2 

52 (omar) 1 
where the value of the constant k will be specified later. One extreme for (31) 
would be to have the theory hold for every point. Then the value of the sum 
would be zero. The other extremes would be when ail points are concentrated 
either at the origin, zero, or at the end, 1. In the first case 


1 be 2 m+ 


— a m= “i 
n(n+1)? maa 6(n + 1) 


The second case leads to the same value, since 2m = }n(n +1) and therefore 


= om \2 1 = 
= i——-§ = i—i+ = m?. 
= 2,( = n(n + 12 


2Qn+l ek 
¥ Re Be ee tea. 39 
Therefore 0<S “rns (32) 


In order to draw conclusions from an observed value G? we have to calculate its 


expectation S?. We will determine & in such a way that S? is independent of n. 
A test similar to (31), but serving another purpose, has been introduced for 
the usual distributions by H. Cramér (1928) and R. von Mises (1931). When 
applied to uniform distributions, this w* test leads to the use of 9, of (23) instead 
of the mean value 7,,. For this test the sum of the deviations is zero which does 


not hold in our case. The expectation SG? of S? is 


~= k(/*[— 2my m2 ] 
—.* -. Pew 
a (=| 9 n+1 si (n+ 1)? 


The first two sums are obtained from (27) and (26). Therefore 


n n 
¥ m(m + 1) ¥ m2? 
82... wt 
n\(n+1)(n+2) (n+1)? 
n n es 
~ n(n+1)\n+2 (n+1)(n+2) 


Biometrika xxx 22 








332 Simple tests for given hypotheses 


The introduction of the sum of the powers of the natural numbers leads to 


- 6 (-7*))- k 
~ 2(n+2)\ 3n+3) 6(n41)° 


Taking k = 6(n + 1) we propose therefore, as test of a hypothesis ho, the coefficient 
oz = m+) S ( tao ). 








Arsen 33 
n m=1 n+1 ( ) 
which, according to (32), can assume values between zero and 2n + 2, and has for 
expectation the value ia 


Ql 


(34) 
Of two competing hypotheses, the one with the smaller value of G? is to be 
preferred. : 

The ©? criterion does not introduce any arbitrary classification. It makes 
use of all observations. Besides the probabilities W(x,,) corresponding to the 
observed values x,, no new calculations are needed. The test has a clear meaning 


and its application is simple. This is due to the fact that it is a natural consequence 
of the probability integral transformation. 


SUMMARY 


We propose the following procedure for testing statistical hypotheses: The 
constants for competing hypotheses, having the same number of constants, are 


determined in such a way that their precisions are approximately the same. Then 


we calculate the probabilities W(z,,), W,(x,,), ..., and their respective control 


curves. We trace W(z,,), %,+0,, and compare it with the observed frequency 
curve. If neither the classical tests nor the control curves indicate a clear superi- 
ority of one of the hypotheses we consider the probabilities as ‘observations’ 
and plot the corresnonding points on the y axis. We now compare, by formulae 
(9)—(17), the ‘observations’ with the theoretical uniform distribution and apply 
the x? and entropy test of formulae (18) and (21), respectively. If necessary, we 
repeat these tests in such a way that the actual position of each point is taken into 
account. Formula (18’) gives a value of x which is independent of the classi- 
fication. Then we plot the cumulative frequency of the ‘observations’, which is 


compared with the straight line (27) and controlled by the values given in Table 2. 
The final test is given by (33). 





A, 





a 


E. J. GuMBEL 333 


REFERENCES 


CraMER, H. (1928). On the composition of elementary errors. Skand. Aktuar. Tidskr. 
11, 13-17, 141-80. 
GuMBEL, E. J. (1935). Les valeurs extrémes des distributions statistiques. Ann. Inst. 
Henri Poincaré, 4, fasc. 2, 115-58. 
—— (1936). La précision de la moyenne arithmétique et de la médiane. Aktuar. Vedy, 
6, no. 4, 145-54. 
—— (1937). Les intervalles extrémes entre les émissions radioactives. I. J. Phys. 
Radium, série vii, 8, no. 8, pp. 321-29; m, 446-52. 
—— (1939). Les valeurs de position d’une variable aléatoire. C.R. Acad. Sci., Paris, 
208, 147-9. 
Misss, R. von (1931). Wahrscheinlichkeitsrechnung, 316-35. Leipzig. 
Neryman, J. (1937). ‘Smooth test’ for goodness of fit. Skand. Aktuar. Tidskr. 149-99. 
NeyMan, J. & Pearson, E. S. (1928). On the use and interpretation of certain test criteria 
for purposes of statistical inference. Biometrika, 20a, 263-94. 
Pearson, E. 8. (1938). The probability integral transformation for testing goodness of fit 
and combining independent tests of significance. Biometrika, 30, 134-48. 
PrEarson, Kart (1933). On a method of determining whether a sample of size n supposed 
to have been drawn from a parent population having a known probability integral 
has probably been drawn at random. Biometrika, 25, 379-410. 
(1934). On a new method of determining goodness of fit. Biometrika, 26, 425-42. 
Pearson, Kart & Pearson, M. V. (1931, 1932). On the mean character and variance of a 
ranked individual, and on the mean and variance of the intervals between ranked 
individuals. Part I: Biometrika, 23, 364-97. Part II: Biometrika, 24, 203-79. 











334 


THE RANGE IN RANDOM SAMPLES 
By H. O. HARTLEY 


1. INTRODUCTION 


Ir the observations x, (v = 1,2,...,n) of a random sample are afranged in 
ascending order of magnitude (x,,, >x,) the range w in such samples is defined as 
the distance between the two extreme observations 
W = X,—2. 

It may therefore be regarded as a measure of the variability or dispersion among 
the observations of the sample. Theoretically its efficiency in the sense defined by 
R. A. Fisher is, as a rule, much inferior to that of the standard deviation. More- 
over, extensive investigations have shown that its random sampling distribution 
is markedly dependent on the parental population (E. 8. Pearson, 1926). For 
large samples x, drawn from a parental distribution f(x) the extreme values 2, 
and x, will lie right inside the lower and upper tail of f(x), and in practice it is 
only in exceptional cases that the exact shape of f(x) has been established to such 
a degree of accuracy that the resulting distribution of w can be trusted for large n. 
In most cases the use of the range must therefore be limited to small samples, 
say with 2<n< 20. 

Large numbers of small samples may often be used with advantage when the 
mean range is calculated as an estimate of the standard deviation of the popula- 
tion (Pearson & Haines, 1935). Although theoretically such an estimate is not 
efficient and certainly not sufficient, it is nevertheless of considerable importance 
in many fields of application because of its simplicity. Statistical control charts 
in industrial quality control make extensive use of it, and more recently the 
range has been applied to investigations in gunnery. 

In some fields of application a disadvantage may arise from the fact that the 
range is an inexact statistic; its random sampling distribution depends on the 
standard deviation of the parent. This applies in particular to the analysis of 
small samples in biological experiments. The tendency of modern small sample 
theory has been to replace such statistics by what are called exact statistics, 
obtained by substituting for the unknown standard deviation of the parent an 
estimate calculated from an independent sample. This particular process of 
reaching exact statistics has sometimes been referred to as ‘Studentization’. 
A general theory of this process will be given in a further paper which it is hoped 
to publish in this journal, where it will be shown how estimates of scale parameters 
in general, and of the range in particular, may be converted into exact statistics. 
In this paper, however, we deal with the case where the standard deviation of the 
parent is known. Indeed, it is this dependence of the random sampling distribu- 
tion of range on the scale parameter of the parent which makes it possible to 
estimate its efficiency as an estimate theoreof. 








H. O. Hartley 335 


The question of grouping has been a subject of investigation in the case of 
the sample standard deviation; we shall here deal with the effect of grouping on 
the range, a problem which has so far received, we believe, no attention what- 
soever.* As practical examples of the occurrence of grouping we may quote 
three instances: 

(a) The rounding off of data for convenience of recording and analysis. 

(b) The recording of data to the nearest unit of measurement. Where the 
technique is of low accuracy (see e.g. Tildesley, 1940) the unit of measurement 
will be comparable in magnitude with the standard deviation of the actual data. 

(c) The analysis of data which are classified in categories. In such cases we 
may often find that the original data are unobtainable so that group frequencies 
are the only material available for an analysis. 

It will also be shown how the random sampling distribution of the range in 
grouped samples provides a suitable approach to that of the true range (ungrouped 
range in sample) on which extensive work has been done in earlier papers published 
in Biometrika. The mathematical formulae developed in this paper make this 
complex distribution amenable to a tabulation. For the case of normal samples, 
the work has actually been carried out and the resulting tables of the probability 
integral are given and discussed elsewhere in the present issue of this journal. 


2. THE DISTRIBUTION OF THE RANGE IN A GROUPED SAMPLE 


Let us denote by 2, ..., 2, the observations in a random sample drawn from 
the parental distribution f(x)} and arranged in ascending order of magnitude. 


This sample is now classified in groups or categories of constant length h with 
equidistant end-points 


an f-th, ..n€-A EEA, .... 248k... (1) 


covering the whole x scale from —0oo to +00. Let us denote ty &,, and &,. the 
respective centres of the categories containing x, and 2,. Then the problem is to 
find the random sampling distribution of the range in a grouped sample, i.e. of 
£,—§. The mean of this distribution is of particular interest. Obviously this 
statistic can only assume values which are multiples of the group interval A and 
is therefore discontinuous. Like the distribution of the ‘ungrouped’ or true range 
it depends on the standard deviation o of the parent f(x). In addition, it depends 


* The effect of grouping seems to be of some importance in researches on the technique of 
anthropological measurement (Tildesley, 1940), where some of the results given below have already 
been applied before this paper had gone to press. 

+ We shall deal here with a parental population represented by a ‘ piecewise continuous’ distri- 
bution function f(x). A function is called ‘piecewise continuous’ for —0o <2< +0 if in any 
closed interval of x the function f(x) is continuous apart from a finite number of ordinary dis- 
continuities. If the actual range of the variate is bounded we simply define f(x)=0 outside this 
range. Moreover, we assume that f(x) has contact of at least second order at +00. It is easy to see 
how our results may be generalized to cover distribution functions with singularities. 











336 The range in random samples 


on the category width A and on the position of the category midpoints relative to 
the population mean X. Of these parameters only A will in practice be known. 
Methods to eliminate o are to be given in a separate paper whilst the elimination 
of X is dealt with in the section on randornized grouping (6). 
It will be convenient to use the following notation: 
i pétsh i *E+ih © fe 
[, =|" fiz)de, | =|" fle)de, I, -| fla)dz. 

i E+ih —-e© y-@ i E+ih 

Let us now find the chance that Eno 


is at most (m—1)h, and that in addition 


E,=E+(i+})h 


for a particular value of 7. This chance is given by 


ee es @) 


The first term in (2) represents the probability for all x, to lie between £ + ih and 
£+(t+m)h. From this we have to deduct the chance for all z, to lie between 
£+(i+1)h and £+(i+m)h which is given by the second term of (2). In taking 
the difference we are therefore left with the chance for the occurrence of a sample 
completely contained in the interval £+ih to £+(i+m)h but with at least one 
of the x; lying between £+7h and €+(i+1)h. This proves that (2) represents 
the required chance. Now, since all samples may be classified with regard to 
their lowest category, the probability for £,—£&, to be at most (m—1)h is given 
by summation over all i of the expression (2). If we denote this probability by 
P(n,h,m—1,&) we find 


comune Me @) 


With equation (3) we have reached a formal representation of the random 
sampling distribution of €, —£,. Its evaluation is a simple matter for large group 
intervals h and for parental distributions f(x) with a tabulated probability integral 


f(x) dz. If we were to take the trouble of tabulating the probability integral 


(3) we should obtain the mean of &,, — £, as a by-product from a summation of (3). 
It will be shown in the next section that this summation, if carried out analytic- 
ally, produces a yery simple formula for this mean. 


3. THE MEAN RANGE IN A GROUPED SAMPLE 


To find the mean of the distribution it is convenient to extend the summation 
in (3) from some finite negative value i = —j up to +00. By choosing j sufficiently 
large the resulting error may be made negligible. We introduce 


rsm—= SLE") ‘ 











— = - 


H. O. HartLey 337 
and find for the difference between P(m— 1) and p_;(m—1)* 
-j i+1 —j+l1 
| Pom—1)—p_fm—1)]< & mf = af, (5) 


for all m and j. To find the mean of £, — £, we must first note the probability for 
this statistic to be exactly equal to mh, where m = 0,1, 2,.... Denoting this 
probability by ¢(m) we have from the definition of P(m) 


¢(m) = P(m)— P(m—1). 


If we denote the mean of £, —£, by 2 we have by definition 


E=hY o(kyk 
k=0 


=h lim {(m+-1) P(m)-S,,}, (6) 
where S, = (m+ 1)d(0) + md(1) +... +¢4(m) 
or S,, = P(0)+ P(1)+...+ P(m). (7) 


To find = let us first consider the second term in formula (6). We have from 
equations (7) and (5) 
S,, = p_;(0)+...+p_,(m)+&, (8) 
p—j4+1 
where | €, | <n(m+1) | , (9) 
for all j and m. Now from the definition of p_,() we find 


m 


Span- 3 (5 tu" YAY] 
-31((" 1 +E" JT 


m (e—j+k- wa 
(ae oe 
k=0 \J -j 





Putting now m = 2j, we have 

2j i+1 n 

p= > (f+ ze (f) tere (10) 

k=0 i=—j+1\, i=-j4+1 

where it is easily seen that 

r—j Pr oo ie i+] 

lel<2n} , lel<2m) + 5 (is. (11) 
J-x Ji+2  i=j4i Ji 


Finally, we want to replace in formula (6) the first term 
(2j+1)P(2j) by (2j+1). (12) 


* In this section we deal with fixed group intervals and a fixed sample size n so that we drop 
the arguments n, h and é. 








338 The range in random samples 


The resulting error is easily estimated. We have from (5) 
(29 + 1) (1— P(2j)) = (29 +1) (1—p_,(2))) +6 
me a 
where | €,| <(2j7+ 1) . (13) 


—-2o 


Moreover, according to the definition (4) we may write 


(2j +1) p_,(2j) = ajo & (yr -([y" 


reef E(( Yr) 
P @j+n(f") +65, 


« i+2j+2 fs) 
where lesi< (+1) 3 of <n(aj+0)[” (14) 
i=-j i+2j+1 j+1 
so that finally we have 


(29 + 1) (1—p_,(2j)) = —€5+€, 


with €4< (2+ 1)n([" +{). (15) 
j+1 —o 


The error terms €,, €, €3, €4, €; and €, are of the form 


2 -j ~ co) i+1 
cil ; cil or > (t- af . 
i —@ i=j i 


It is easy to see that the above terms tend to 0 as joo. To prove this for the 
first term we write 


<P s)f<efAfeme 


which tends to 0 as f(x) has contact of order 1 at +00. The proof for the other 
terms is identical. For sufficiently large j we can therefore use the approximations 
given by (10) and (12) and transform equation (6) into the convenient form 


e-nim laren SLY- SLY) 0 


Equation (16) gives the mean range J in a grouped sample in terms of powers of 
the probability integral of the parental population. For a normal distribution 
this is a particularly simple formula since such powers have already been cal- 
culated by L. H. C. Tippett (1925) and are conveniently tabulated in Table XXI 
of the Tables for Statisticians and Biometricians, Part II. A table of = can, there- 
fore, be easily computed by adding a few entries from Tippett’s table and 
deducting the sum from the appropriate value of 2) + 1. 

This has been done for samples of five, ten and twenty observations grouped 
in categories of breadth h (see table on p. 339). The parameter £ denotes the 





H. O. HartTLey 339 


distance of the population mean X = 0 from the nearest group end-point. For 
given h the mean range = in grouped samples is obviously a symmetrical periodic 
function of € with period h. The table has been extended to cover rather coarse 
grouping intervals (h = 2-20) in order to illustrate the possible bias of range when 
estimated from frequency tables with as few as two or three categories. Itisapparent 
that for small or moderate group intervals, say h <o, the mean range is practically 
independent of h and &, so that no correction (corresponding to the well-known 
Sheppard’s correction for the sample standard deviation) is required for the 


Table of mean range in grouped samples drawn from a normal 
populaiion having unit standard deviation 


Size of sample =n. Width of group interval=h. 
Distance of population mean to nearest group-end point =€. 























f et ects rT 
h g | n=5 n=10 | n=20 
| | a pete: | x 
| 02 0-0 2-32 593 307751 | 373495 | 
| | 
0-6 0-0 2-32 593 307750 |. 373500 | 
0-2 2-32 593 307751 | 373492 | 
| 
1-0 0-0 2-32 532 308122 | 372917 | 
0-2 2-32 574 307865 | 373317 | 
| 0-4 2-32 642 307450 | 3-73962 | 
| | 
1-4 0-0 2-31 042 3-06 204 382069 | 
0-2 2-31 626 3-06 787 3-78 826 
0-4 2-32 938 3-08 095 3-71 575 
0-6 2-33 990 309143 | 3-65 796 
1:8 0-0 2-29 227 290539 | 367974 
0-2 2-30 022 2-94 491 3-69 553 
0-4 2-32 023 3-04 621 3-73 085 
| 0-6 2-34 276 3-16 356 3-76 263 
0-8 235734 | 324140 3-77 838 
i 
| 22 | 00 2-36 O11 2-77 080 3-27 509 
| 02 | 235552 2-81 669 3-34 611 
0-4 2-34 221 2-94 307 3-53 896 
0-6 2-32 265 3-11 581 3-79 662 
0-8 2-30 255 3-28 173 4-03 847 
1-0 2-28 963 3-38 358 4-18 452 


t 


range. For h = 0-20 the mean range in the grouped sample agrees with the theo- 
retical ungrouped range to five places of decimals (see Table of Mean Range, 
Table XXII of Tables for Statisticians and Biometricians, Part IL). For coarse 
grouping the correction becomes important but depends on & (as well as on h). 
For fixed h, as € varies from — }h to + $h the grouped mean range oscillates about 
the true mean range as a smooth single-peried function. The reason for this is 
obvious. If € has a position such that the average positions of x, and 2, both 
happen to fall within the outside halves of two group intervals, then €,,—&, will 








340 The range in random samples 


on the average, be smaller than 2x, —2,; vice versa, if the average positions of 
x, and 2, are in the inside halves of two group intervals, there will be a pre- 
dominance of samples for which &,, — &, is larger than x, —2,. 

Moreover, as A increases, the grouped mean range becomes a less reliable 
estimate, and it can be shown that for h>o the standard deviation of the 
random sampling distribution of the grouped range rapidly increases with h. 

We are thus led to consider two problems; one is the elimination of the para- 
meter £ (or the dependence of the distribution on the position of the parental 
population mean); the other is to investigate more closely the random sampling 
distribution of the grouped range. Before dealing with these problems, however, 
we must first consider the distribution of the true range (range in the ungrouped 
sample). 


4. THE PROBABILITY INTEGRAL OF THE RANGE IN RANDOM SAMPLES 


As before we denote by 2,, ..., 2, the observations in a random sample drawn 
from a parental distribution f(x) and arranged in ascending order of magnitude. 
The range in such.a sample, defined as w = x, —x,, may be regarded as the limit 
of the grouped range £, —&, as h, the group interval, tends to 0, i.e. 


w = limé, —&). 
h—>0 


The probability integral of the range w, denoted by P,(W), is therefore the limit 
of P(n,h,m—1,&), given by (3), as A tends to 0. To obtain this limit we write 
equation (3) as follows: 


+0 E+(i+m)h n £+(i+m)h n 
P(n,h,m—1,£)= ¥ ({ fla) dz] -({ fla) dz) 


i=—o\J &+ih E+(i+1)h 


= Ef sends)” 704, 


ix-o \Jk 
where £,; is some mean value between £ + ih and £+(i+1)h. 
We now put m= W/h or W=~mh, 
and let h tend to 0, m to 00, keeping W constant. We obtain without difficulty 
P,(W) = lim P(n,h,m—1, &) 


h—0 
m-—> @ 


=nf "pe ([" sarae)” a, (17) 


Jé 


which is the required probability integral of the range. This integral may be 
compared with the expression for the distribution function of w which was given 
by A. T. McKay & E. S. Pearson (1933). It is easily verified that the function 
¢(w) given by these authors is the differential of P(W). The expression for P(W) 











H. O. HartTLey 341 


is decidedly simpler than that for ¢(w) which was used by Pearson for numerical 
work on this function. However, even P(W) is of a complex character, and only 
in special cases is it possible to evaluate it analytically. For the rectangular 
distribution function (f(x) = 1, 0<a2<_1) this can be done easily. 


5. THE PROBABILITY INTEGRAL OF THE RANGE IN SAMPLES 
FROM A NORMAL POPULATION 


Of particular interest is the case where the parental population is normal. 
In this case we have 1 


= — —}z2 
f(x) = a(x) = (2m) ° ms (18) 
L. H. C. Tippett (1925), E.S. Pearson (1926, 1932) and A. T. McKay & E.S. Pear- 
son (1933) have considered this problem and carried out extensive numerical work. 
The method adopted was to calculate correct values of the means and standard 
deviations of the distributions (as functions of n) and then, with the help of approxi- 
mate values of £, and /, use as approximations to the unknown true distribution 
Pearson-type curves fitted by the method of moments. The numerical results, 
although they have been successfully tested by experimental sampling, have, 
of course, an unknown accuracy. It is therefore desirable to find a method which 
produces P,(W) to known and sufficient accuracy. 


+ E+W n-1 
We have P(W)= nf 2(&) ( 2(x) dz) dg 
J-« £ 
—-iW +a 
= | +nf =I[,+I, (say). 
—« Ww 
Writing n=—(E+W), -E=9+W, 
~ —— a—l 
we obtain P,(W)=n | 2(—9— W) ( | 2(x) dz) dy + I. 
—iW J —(9+W) 


Using the symmetry of z(z) we may write 


PW) = nfo 


7+W n-l1 
2(y + W) ( ( “ dz) dy +I, 
—iW 


¥7 


and writing £ as a variable of integration in place of 7 we find 


hee) E+W n—1 
Py(W) = n| [eE+ W)+206)( | 2(v) dr) dé, 
—tWw g 
me) E+W n—-1 
or PW) = —nf [E+ W) a6 ([ 2(z)dz)" dé 
—tW gE 


@ f+W n-1 
- anf" 26+ W (| 2(x) dz) dé. (19) 


4 
> 








342 The range in random samples 


Integrating the first integral and introducing u = + W in the second integral 
we finally obtain 


P,(W) = ( . "2(e)de) "+20 [ 2(u) ({" z(u)de)" oe (20) 
Ww } Ww u-W j 
For large values of W this is an approximate solution of the problem since the 
second term in (20) is small, so that the first term 
+a n 
(| -2(a) dr) 
—in } 
gives a fair approximation to P,(W). This expression denotes the chance of 
observing samples with observations all lying between —}W and +4W; all 
these samples he.ve a range smaller than or equal to W. For large W it is these 
samples which constitute an ever-increasing proportion of the total number of 
samples with range < W. 

The second term in (20), which is always positive, takes into account all those 
samples which are not contained in the interval —4}W to + 4W. This term cannot 
be ignored if high accuracy is required and if W is small or moderate. Never- 
theless, the work involved in the numerical integration has been considerably 
reduced, for the range of integration is now from +4W to +00. 

The numerical integration of 


ao ({"- te) dx)" du 


is best carried out simultaneously for values of n forming an arithmetical pro- 
gression. For fixed u and W, the integrand is then given by the terms of a geo- 
metrical progression with, say, 


z(u) ({" ae de) 


as first term and (" x(a) dx) 
\J U-W } 


as common ratio. Such a geometrical progression can be produced automatically 
by certain modern calculating machines. This forms the basic idea of the actual 


computation of the probability integral which is described in detail in another 
paper (pp. 309-10 above). 


6. THE RANGE IN RANDOMLY GROUPED SAMPLES 


The results of sections (2) and (3) on the effect of grouping on the distribution 
of range depend on the parameter £, which denotes the origin of the equi- 
distant set of group end-points 

€+th «+=0,1,2.,..., 


—l, — 2, 


sees 


given by equation (1). In practice, however, all we know is the category breadth, h. 

















ro rs @ oe 








H. O. Hartiny 343 


We then select the actual group end-points £+ih from considerations which 
are, by necessity, independent of the position of the mean X of the parental 
population f(x), because this position will generally be unknown. One of our 
group end-points, however, is bound to fall into the interval 

X-—th to X+Hh, 
wherever this interval may lie. Now, since the origin € in our system of group 
end-points is wholly arbitrary we may assume that for given h we have the 
inequality X-—th<E<X+hh. 
The fact that our group end-points (and therefore their origin £) are chosen in- 
dependently of X has now to be expressed in mathematical terms. This is done 
by assuming that we are dealing with a population of values of & (being the origins 
of corresponding systems of group end-points) which are rectangularly distributed 
in the interval X—}h<£<X+th. This condition is exactly fulfilled where 
grouping has been introduced through rounding off of data (example (a) on 
p. 335), and it is often an appropriate assumption in the other examples, as in 
many other cases of grouping which occur in practice.* 

In order to derive the distribution of range in samples randomly grouped in 
the above sense we have to return to section (2). In this section we derived the 
probability P(n,h,m —1,£), giving the chance that the difference between the 
centre points £, and , of the highest and lowest category covered by a sample 
of n items is at most (m—1)h, where h is the constant category breadth and 
group end-points are given by (1). The frequency distribution of £,—&, which 
we may denote by ¢(n, h, m, €) is then given by 

o(n, h,m, £) = P(n,h,m, £)— P(n, h, m—1, &), 
and represents the chance that ¢, — £, is exactly mh, given a particular value of &. 
The corresponding frequency distribution for random grouping may be denoted 
by ¢(n, h, m). To derive it we may apply Bayes’s Theorem and obtain 


1 PX+ih 
B(nh,m) = 5 | Blo, m, Ea. 
X-th 
The resulting cumulative probability may therefore be defined by 
Pin,h,m) = 3 Hmhj) =  S Pmbadn 8d, 
which yields the vical relation for ae cnalaies probabilities 


X+ Mg 
P(n,h,m) = 3 P(n, h, m, €) dé. 


* In certain cases, when grouping is very coarse, it may be advantageous to use an estimate 


2 of the population mean X (either dependent or independent of the sample whose range is con- 


sidered). The increase in information is akin to that given by an ancillary statistic in estimation 


theory. However, from the results in §3 it would appear that little information is gained where 
the grouping interval is small or moderate. 








344 The range in random samples 


Substituting, now, the expression (3) for P(n,h,m,£&) we obtain 


rg £4 Cs (fo rerae)*— (fr aya dé 


X—th \ £+th E+(i+ 1h 


=} | a ( | ‘ge fede)" 2 fla) dz) | dé. (21) 


This formula may be reduced to a simpler form in which its relation to the pro- 
bability integral of the true range (17) becomes apparent. We introduce the 
second integral 


w 
E,W) = |” Paleo) dw 


- | . | oa fe) nm fia)dx)" dgdw. (22) 


The first integration is with regard to w and the integrand is an integral with 
regard to &. If, now, in this latter integral 7 = £ + w is used as variable of integra- 
tion in place of we have 


Ww +0 (tn n—1 
RW) =| nf fy—u)([" fleyde)” "ddr 
0 —-o n—w 
We now note that the integrand may be written as a differential with regard to w. 
Thus, interchanging the order of integration we obtain 


“fee 0 asl 
F,(W) -(" I, Tol |. fede] dwdy 


+0 { n 

-| wit dy, (23) 
=o Je 

thus eliminating integration with regard to w. The surviving variable of integra- 

tion 7 may now be replaced by £ = 7 — W so that we reach the final result 


FW) -| ei | pa) de)" a8. (24) 


We now observe that this integral is identical with the one occurring in the 
expression (21) for P(n,h,m), and we note the relation 


1 


P(n,h, m) = + {F,(m+1h)—F,(mh)} 
1 f(mt+ Ih 
=-— P,(w) dw. (25) 
h,. mh 


This simple formula makes it possible to obtain the effect of random grouping on 
the probability integral of range P,(W). In particular, for normal samples for 
which P,(W) has been tabulated at the fine interval of Aw = 0-05, the second 
integral F,,( W) is easily obtained numerically by summation of the tabular entries 
in the table on pp. 302-7 above. 

















sc 











H. O. HartLey 345 


For h>0, moo, mh—> W, we note that, as expected, P(n,h,m)— P,(W); 
that is, as grouping becomes finer and finer the probability integral of the grouped 
range tends to that of the true range. What is not quite obvious, however, is the 
identity of mean grouped range and mean true range, no matter how large the 
breadth, h, of the randomly placed groups. This point we shall now examine. 

If we denote by w(n,h) the mean range in samples of n items classified in 
groups of breadth h randomly placed, we have by definition 


w(n,h) = >» (o™ h,m)mh = = st h,m)—P(n,h,m—1)}mh. = (26) 


We now introduce central differences of the function F,,(W) and use the notation 
‘my = F,(m+lh)—F,(mh),. At, = F, (m+ 1h)—2F,(mh)+F,(m—1h), (27) 
so that we obtain from (25), (26) and (27) 


w(n,h) = ¥ Alm 
m=1 
We may now write 


M 
w(n,h)= lim > Alm 


M—->o m=1 


: Q M-1 
jim [Maes 3 p> pAmet 


M->o 


lim M4, — F,(Mh)}. (28) 


M—> 


On the other hand, we have for the mean true range (W,, say) 


0, = [vate dw, 


where /f,(w) = £ P,(w) is the distribution function of the true range. We, 


therefore, have 


Mh 

w, = lim » wale) dw 
M—>o 
= lim = Mh) =f P,( (w) de}. (29) 
M->o 


On taking the difference of (28) and (29) we see that 


w(n,h)—@, = lim Mh fi wus P(t}. 


M->o 
l alia 
Now 34: waa = i a P,(w )dw = =P (Mh) +f (w*) h*, 
where Mh<w*<(M+1)h and h*<h, 


so that | w(n,h)—W, |< lim f,(w*) Mh?. 
M—>o@ 





346 


The range in random samples 


From this relation it is obvious that w(n,h) = W,, since f,,(w) has contact of at 
least second order as w-—>0o.* 

We have proved, therefore, that if grouping is random the expectation} of 
the mean range w(n, h) (mean grouped range) is identical with the true mean range 
®, so that no bias is introduced through random grouping, no matter how large 
the grouping interval h. However, if we wish to use £,,— &, as an estimate of w,, 
this estimate, although unbiased in the sense defined, becomes less and less 
reliable as h increases. This is borne out by its random sampling distribution or 
its probability integral P(n, h, m) given by (25). For normal samples it is an easy 
matter to tabulate P(n,h,m) from the table of P,(W) (pp. 302-7) and thus to 
follow up’ the numerical increase of its standard deviation as h increases. How- 
ever, to cover the case of a general parental distribution f(x), we shall derive an 
analytical formula for the variance of £,, —£, from which approximate numerical 
results are easily obtained. 

In order to obtain this formula we consider the second moment of £,,—& 
which we may denote by y,(n, h). We have by definition and from equations (25), 
(26) and (27) 


M 1 
B(n,h) = lim > m*h?> Am 


M—>o m=0 


M>om=0 


M 
= lim ¥ hm(m+})4",—}hw,,. (30) 
Now we may write , 
M — P ; ae 
2 mm +t) Am _ 2 mi{(m+ 4) m+4— (M—4)A,_y}- UMA m—a 


M-1 


M-1 
= M(M +4) 4y44- y (m+) m+4— MF,(Mh)+ > F,(mh) 
m=0 m=0 
= M(M+})4'y,,—(2M—}) F,(Mh)+2 5 F,(mh). (31) 


m=0 
Using equation (31) we obtain for the second moment (30) 
«ta? +( om) ] ( - es hs | 
f,(n, h) = in ita) 1+ aM 2MhF,(Mh){1 im +2h = Falmh) 5 n}- 
(32) 
This formula enables us to compare y,(n,h) with y(n), the second moment of 
the distribution of the true range. We have by definition 


‘ Mh 
(mn) = lim f,(w) w* dw, 
M>oJ0 


* It can be proved that the order of contact of f,(w) is the same as that of the parental dis- 
tribution f(z). 


+ If repeated samples were drawn from the same population and the same grouping system 
used in each case, the mean grouped range would be biased by an unknown amount. But in repeated 
experience with different populations the expectation of this bias is zero. 





a 














———EE a I, 





TY 





H. O. HartLey 347 


which we transform by two partial integrations into the equation 


p(n) = lim {cara P,(Mh) —2(Mh) F,(Mh) +2 | » E(w) au] (33) 
M—>o 0 


Taking the difference of (32) and (33) we obtain 
fs, et 8, 


. 2 2 


a(n, h)— p(n) = lim 
M>o 


M-1 Mh 
+ lim [2 > F,(mh)+hF,(Mh)—2 { F,(w)du}. (34) 
m=0 0 


M—>o 


Since the expected mean value of the grouped range is the same as for the 
true range, the difference in second moments about zero equals the difference in 
variances. The first term on the right-hand side of equation (34) is obviously 0 
(see equation (29)), whilst the second term is best evaluated with the help 
of Gregory’s formula for numerical integration (see e.g. L. J. Comrie, 1936, 
p. 809). Using this formula we can express the difference between the integral 
and the finite sum in equation (34) in terms of the differences of the integrand 
F,(w) at the two ends of the range of integration. We obtain 


> 2h , , 2h ” " 
Ha(n, h) — p(n) = bl {54a —Ay}+ 94 (4-1 + Ay}+ 4 


cos l 
= gh (h—Ay) + hit...» (35) 


provided the Gregory expansion is convergent.* 4j, 4], 4j, ... are advancing 
differences of the function F,(w) at w = 0. 

Equation (35) yields the desired formula for the second moment of &, —&). 
For most parental distributions the resulting probability integral of the range 
will be practically 0 for a certain range in the neighbourhood of the origin (see for 
instance the behaviour of P,(W) from a normal parent given in the table on 
pp. 302-7). For such parents and for moderate h we have 


(SaaS, & ... SG, 
so that fg(n, h) — tg(n) = dh?. (36) 


For small or moderate values of h, therefore, the increase in variance of the 
grouped range is given approximately (and for most parents to a high degree of 
accuracy) by }/?. This increase is double the amount given by the well-known 
Sheppards correction of ;4;h?. Indeed, had we grouped a sample of true ranges w 
in fixed categories of breadth h, the resulting second moment of the grouped 
distribution df w would have an expectation which is 4,4? in excess of the second 
moment of the true range. With random grouping of the original sample (as it 


* This condition is as a rule fulfilled for values of h which do not exceed the standard deviation 
of the parental distribution f(z). 











348 The range in random samples 


has been defined above in accordance with common practice of grouping) an 
additional uncertainty is introduced by using a new, randomly selected, set of 
group intervals each time a new grouped range is determined. This additional 
uncertainty has been proved roughly to double the excess of the variance and the 
result is an increase of ¢h? over the variance of the true range. 


I wish to acknowledge with gratitude the helpful suggestions and criticisms 
made by Drs J. Wishart and J. O. Irwin and by Professor E. S. Pearson at 
various stages of this investigation. 


REFERENCES 


Comnriz, L. J. (1936). Interpolation and Allied Tables. H.M.S.O. 
McKay, A. T. (1935). Biometrika, 27; 466-71. 

McKay, A. T. & Pearson, E. 8. (1933). Biometrika, 25, 415-20. 
PEarson, E. S. (1926). Biometrika, 18, 173-94. 

(1932). Biometrika, 24, 404-17. 





Pearson, E. S. & Harnss, J. (1935). J.R. Statist. Soc. Suppl. 2, 83-98. 
TILDESLEY, M. L. (1940). Man, 40, 180-9. 


Tippett, L. H. C. (1925). Biometrika, 17, 364-87. 














qa 











349 


MISCELLANEA 


(i) The Second Yearbook of Research and Statistical Methodology Books 
and Reviews. Edited by Oscar Krisen Buros. The Gryphon Press, 
Highland Park, New Jersey, 1941. $5. 


This is a second and much enlarged issue of a volume published in 1938. It contains 
nearly seventeen hundred review excerpts on 346 statistical and allied books (in the 
English language only), extracted from 283 different journals. The editor has attempted 
with considerable success to cover the whole field of statistical and probability theory, as 
well as their applications in every possible direction. He has also included reviews of a 
number of books on the general history of science, on scientific method and on the social 
relations of science on the ground that they are—or should be—of general interest to 
scientific workers in every special field. Included in this category are books such as 
J. D. Bernal’s The Social Function of Science, J. G. Crowther’s The Social Relations of 
Science and J. B. 8. Haldane’s The Marxist Philosophy and the Sciences. 

This large volume of several hundred pages has been admirably produced and arranged. 
It is intended to publish a fresh volume every two years containing reviews that have 
appeared in the interval. The Preface sets out a variety of reasons which, in the Editor’s 
opinion, justify the present venture and even its enlargement in the future if sufficient 
support is forthcoming; at the same time frank expressions of opinions are asked for from 
readers and reviewers. 

The objectives of the Yearbook as set out may be classed under four general heads: 

(1) To help students, teachers and librarians to select text-books with greater dis- 
crimination and to point out to them the weak and strong points of particular books. 

(2) To indicate the width of the subject of statistics and the many fields in which it 
is applied. 

(3) To make students and teachers aware of the inadequacy of much that is now pre- 
sented in text books and classes; to discourage the publication of books written by persons 
ignorant of the latest developments in their subject. 

(4) To improve the quality of reviews by encouraging editors and reviewers alike to 
take their responsibilities more seriously. 

With the last three objectives it is hardly possible to quarrel, and it is likely that the 
wide circulation of this volume would provide one of the most direct methods of attaining 
these ends. The first objective is, however, presumably the most important, and there are 
bound to be differences of opinion on the probable success of the book in this direction. In 
the ordinary event the teacher will no doubt be made aware of new books in the field with 
which he is concerned by reading the notices in one or two journals specially devoted to his 
subject. Having obtained a suggestion of a likely book he must surely get hold of it and 
determine by reading it himself whether it is suitable and abreast of the latest develop- 
ments. If he is not competent to do this, but must base his decision on the advice of 6-10 
reviewers, it seems doubtful whether he should be teaching the subject at all. 

After reading through the reviews on some dozen books contained in the present 
Yearbook, I am inclined to the following conclusions. Regarding books of outstanding but 
perhaps rather controversial character, as those of Harold Jeffreys and Richard von Mises, 
the reader will certainly gain a useful impression from the collected reviews. This is partly 
because in such cases the standing of the reviewers is high and their reviews interesting and 
fairly written, even if critical. But in the case of the more elementary text book, the 
position is rather different. Quite often the opinions expressed are diametrically opposite. 
In cases where I knew nothing of the book or its author I found myself inevitably forced 
to form an opinion from my own personal knowledge of the experience, the special interests 





350 Miscellanea 


and even the character of the reviewer. Such inside information will generally not be 
possessed ]py the College instructor and certainly not by the student. We cannot, I think, 
escape the conclusion that the teacher who has to select a text book for his students must 
be competent to decide on its merits himself and, if he is not, he may be only confused by 
the varied opinions contained in the Yearbook. 

Three future directions in which the volume might be enlarged are contemplated: 

(1) The inclusion of reviews of foreign language (i.e. not English) books. 

(2) The addition of a section devoted to non-critical abstracts of periodical literature 
on research and statistical methods. 

(3) The publication of original criticisms by one or more persons (according to the 
importance and controversial nature) of articles and papers in the periodical literature. 

The first addition is clearly desirable; the publication of translations of reviews in 
foreign journals of our own American and British books would probably be useful too; it 
would help us to see ourselves as others see us. With regard to the second and third pro- 
posals, the great difficulty is of course to secure the services of sufficiently competent 
abstractors or critics for so large an undertaking. If, quoting the Editor, the statistical 
student and teacher are to be kept ‘abreast of modern developments in statistical theory’; 
to be warned ‘to ignore much of the literature which either presents nothing new or presents 
inefficient or incorrect methods of statistical analysis’; to be told what are ‘sloppy, value- 
less, and erroneous articles’ and what are ‘well-written, significant contributions’, it is 
clear that a very great responsibility will lie on the Editor of the Yearbook and his col- 
laborators. As Prof. Buros indicates, the organizing and editing of such a comprehensive 
service would need the support of a foundation interested in fostering the advancement of 
research. Indeed, to avoid duplication the organization must be built up on an inter- 
national basis, possibly in collaboration with such bodies as the American Statistical Asso- 
ciation and the Royal Statistical Society between whose representatives some discussion 
of a similar project took place a few years ago. E.S.P. 











eee 
—————— . 























a 
he 
—_ 
am 
= 
— 
— 
= 
© 
— 
_ 


—E - = 
RS 














